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USE CMP ri -1,4^LUCAN LYASE FOR PREPARATION OP l>I>.ANHYDROFRUCTOSE 

The present invention relates to the use of an enzyme, in particular a-l ,4-glucan lyase 
("GL"), to prepare 1,5-D-anhydrofructose ("AF") from substrates based on a-1,4- 
5 glucan. 

The present invention also relates to the use of a sugar, in particular 1,5-D- 
anhydrofructose ("AF"), as an anti-oxidant, in particular as an anti-oxidant for food 
stuffs and beverages. 

10 

The present invention relates to the use of 1,5-D-anhydrofructose ("AF") as a 
sweetener, in particular as a sweetener for foodstuffs and beverages, preferably 
human foodstuffs and beverages. 

15 FR-A-2617502 and Baute et al ui Phytochemistry [1988] vol. 27 No. 1 1 pp3401-3403 
rqport on the production of AF in Morchella vulgaris by an apparent enzymatic 
reaction. The yield of production of AF is quite low. Despite a refermce to a 
possible oizymatic reaction, neither of these two documents presents any amino acid 
sequence data for any ^yme let alone any nucleotide sequence information. These 

20 documents say that AF can be a precursor for the preparation of the antibiotic pyrone 
microthedn. 

Yu et al in Biochimica et Biophysica Acta [1993] vol 1156 pp3 13-320 report on the 
preparation of GL from red seaweed and its use to degrade Q:-l,4-glucan to produce 
25 AF. The yield of production of AF is quite low. Despite a reference to the enzyme 
GL this document does not present any amino acid sequence data for that enzyme let 
alone any nucleotide sequence information coding for the same. This document also 
suggests that the source of GL is just algal, 

30 A typical a-l,4-glucan based substrate is starch. Today, starches have found wide 
uses in industry mainly because they are cheap raw materials. 
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Starch degrading enzymes can be grouped into various categories. The starch 
hydrolases produce glucose or glucose-oligomers. A second group of ^starch 
degrading enzymes are phosphorylases that produce glucose- 1 -phosphate from starch 
in the presence of inorganic phosphate, 

5 

AF has also been chemicaUy synthesised - see the work of Lichtenthaler in 
Tetrahedron Letters Vol 21 pp 1429-1432. However, this chemical synthesis involves 
a large number of steps and does not yield large quantities of AF. 

10 The chemical synthetic route for producing AF is therefore very expensive. 

There is therefore a need for a process that can prepare AF in a cheap and easy 
manner and also in a way that enables large quantities of AF to be made. 

IS Furthermore, anti-oxidants are typically used to prevent oxygen having any 
deleterious effect on a substance such as a foodstuff. Two commonly used anti- 
oxidants are GRINDOX 142 and GRINDOX 1029. These anti-oxidants contain many 
components and are quite expensive to make. 

20 There is therefore a need to have a simpler and cheaper form of anti-oxidant. 

Furthermore, sweeteners are often used in the preparation of foodstuffs and 
bevemges. However, many sweeteners are expensive and complex to prepare, 

25 There is therefore a need to have a simpler and cheaper form of sweetener. 

According to the present invention there is provided a method of preparing the sugar 
1,5-D-anhydrofmctosc comprising treating an a-l,4-glucan with the enzyme a-1,4- 
glucan lyase characterised in that enzyme is used in substantially pure form. 

30 

Preferably if the glucan contains links other than and in addition to the a-1,4- links 
the Qr-l,4-glucan lyase is used in conjunction with a suitable reagent that can break 
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the other links - such as a hydrolase - preferably glucanohydroiasc. 

Preferably the glucan is starch or a starch fraction prepared chemically or 
enzymatically. If prepared enzymatically the reaction can be performed before the 
S addition of the a-1 ,4-glucan lyase or the reactions can be performed simultaneously. 
The suitable reagent can be an auxiliary enzyme. Preferred auxiliary enzymes are 
alpha- or beta-amylases. Preferably a debranching enzyme is used. More preferably 
the auxiliary enzyme is at least one of puUanase or isoamylase. 

10 Preferably the Qr-l,4-glucan lyase either is bound to a support or, more preferably, 
is in a dissolved form. 

Preferably the enzyme is isolated from either a fungus, preferably Morchella costata 
or Morchella vulgariSp or from a jfungally infected algae, preferably Gracilariopsis 
15 lemaneiformis, or from algae lone, preferably Gracilariopsis lemaneiformis. 

Preferably the enzyme is isolated and/or further purified from the fiingus or from the 
fimgally infected algae or algae alone using a gel that is not degraded by the enzyme. 

20 Preferably the gel is based on dextrin or d^vatives thereof. 

Preferably the gel is a cyclodextrin * more preferably beta-cyclodextrin. 

Preferably the mzyme comprises the amino acid sequence SEQ. ID. No. 1. or the 
25 amino acid sequence SEQ. ID. No. 2 or the amino acid sequence SEQ. ID. No. S or 
the amino acid SEQ. I.D. No. 6, or any variant thereof. 

In an alternative preferable embodiment, the enzyme comprises any one of the amino 
acid sequences shown in SEQ. ID. No.s 9 - 11, or any variant thereof. 

30 



The term "any variant thereof means any substitution of, variation of, modification 
of, replacement of, deletion of or addition of an amino acid from or to the sequence 
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providing the resultant enzyme has lyase activity. 

Preferably the enzyme is used in combination with amylopectin or dextrin. 

S Preferably^ the enzyme is obtained from the expression of a nucleotide sequence 
coding for the enzyme. 

Preferably the nucleotide sequence is a DNA sequence. 

10 Preferably the DNA sequence comprises a sequence that is the same as, or is 
complementary to, or has substantial homology with, or contains any suitable codon 
substitutions for any of those of, SEQ. ID. No. 3 or SEQ. ID. No. 4 or SEQ. ID. 
No, 7 or SEQ. ID. No. 8. 

IS In an alternative preferable embodiment, the DNA sequence comprises any one of the 
sequraces that are the same as, or are complementary to, or have substantial 
homology with, or contain any suitable codon substitutions as shown as SEQ. ID. 
No.s 12 - 14. 

20 The expression "substantial homology** covers homology with respect to structure 
and/or nucleotide components and/or biological activity. 

The expression "contains any suitable codon substitutions" covers any codon 
replacement or substitution with another codon coding for the same amino acid or any 
25 addition or removal thereof providing the resultant enzyme has lyase activity. 

In other words, the preset invention also covers a modified DNA sequence in which 
at least one nucleotide has been deleted, substituted or modified or in which at least 
one additional nucleotide has been inserted so as to encode a polypeptide having the 
30 activity of a glucan lyase, preferably having an increased lyase activity. 
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Preferably the starch is used in high concentration - such as up to about 25% 
solution, 

Preferably the substrate is treated with the enzyme in the presence of a buffer. 

5 

More preferably the substrate is treated with the enzyme in the presence of 
substantially pure water. 

Preferably the substrate is treated with the enzyme in the absence of a co-factor. 

10 

According to the preset invention there is also provided a method of preparing the 
sugar l,S-D-anhydrofructose comprising treating an a-l94'-glucan with the enzyme tt- 
1,4'glucan lyase characterised in that enzyme comprises the amino acid sequence 
SEQ. ID. No. 1. or the amino add sequence SEQ. ID. No. 2 or the amino add 
IS sequence SEQ. ID. No. 5. or the amino add sequence SEQ. ID. No. 6, or any one 
of the amino acid sequences SEQ. I.D. No.s 9-1 1, or any variant thereof. 

According to the preset invention there is also provided the sugar 1,5-D- 
anhydrofructose when prq>ared by the method of the present invention. 

20 

AF prepared by the present method was confirmed and characterised by ^^C NMR. 

One of key advantages of the present method is that the sugar 1 ,5-D-anhydrofructose 
can be prepared in much larger quantities than before and by a method that is 
2S rdatively easier and cheaper than the known processes. For example the sugar can 
now be prepared in amounts of for example greater than lOOg - such as 500g - 
compared to the prior art methods when only much smaller amounts were and could 
be produced - such as micro gram amounts* 
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Typical reactions that can be catalyzed by GL can be summarised as follows: 

1). Amylopectin ►AF + limit dextrin 

5 2). Amylose » AF -h limit dextrin 

3). Dextrin »AF 4- glucose 

In reaction 1), the ratio of the two products depend on the structure of amylopectin 
10 or the distribution of of-l,6-gIucosidic linkages in the amylopectin molecules. 

In reaction 2) and 3), the ratio of the products dq)ends on the degree of 
polymerisation (DP) number of the substrate. In reaction 3 the ratio between AF and 
glucose depends upon the DP. For example if the dextrin contains 10 glucose units 
15 the ratio AF:glucose would be 9: 1 . 

Another advantage of the present invention is that glucans that contain links other 
than a-1,4- links can be substantially degraded - whereas before only partial 
degradation was achieved. The substantial degradation of the l,S-D-anhydrofructose 
20 precursor is one of the factors leading to the increased yields of 1,S-D- 
anhydrofhictose. 

Other advantages are AF is a naturally occurring substance and therefore it has a 
potential for human purposes. For example, it can be converted to the antibiotic 
25 microthecin by AF dehydrase. Antibiotics are known for their uses in food bio- 
preservation, which is an important area in food technology. However, to date, the 
preparation of AF and also microthecin has had a number of disadvantages. For 
example, only small quantities could be produced. Also, the process was costly. 

30 The present invention overcomes these problems by providing a larger production of 
and much cheaper production of AF and so also other products such as microthecin. 
In this regard, it is possible to prepare gram to kilogram amounts of AF. 
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A further advanatge is that the lyase is stable for at least one year at 4"C and can be 
lyophilized without loss of activity. 

Another advantage is that the lyase produces AF directly from starches and does not 
5 need the presence of any co-factors. 

Another advantage is that the enzyme can be used in pure water. This result is very 
surprising. 

10 Based on the simple properties of the present lyase, one can expect that the 
production cost of AF will be comparable to that of glucose. This is especially 
advantageous that the present lyase does not necessarily require the presence of any 
co-factors which aie generally very expensive. 

15 In general a-l,4-glucans can be used as substrate for the enzyme. 

« 

As a preferred substrate, starch is used. 

In a preferred process, soluble or gelatinized starch or starch hydrolysate axe used. 
20 The starch hydrolysates can be prepared either chemically or enzymadcally . 

If an enzyme is used for the partial starch degradation the enzyme can either be added 
before the addition of the lyase or any other additional starch degrading reagent (such 
as the enzyme glucanohydrolase) which may be added simultaneously. 

25 

The lyase will convert the glucan to AF. The enzyme will attach the substrate from 
the non reducing end and leave only the reducing sugar unconverted. The residual 
glucose can be removed by known methods some of which have been described here. 



30 



Using the reaction described here pure AF can be produced and also in large 
amounts. 
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In one embodiment, the a-l,4-giucaii lyase is purified from the fungally infected 
algae - such as Gracilariopsis lemaneiformis - by affinity chromatography on /3- 
cyclodextrin Sepharose, ion exchange chromatography on Mono Q HR 5/5 and gel 
filtration on Superose 12 columns. The purified enzyme produces l,5~anhydro-D- 
5 fructose from a- 1 ,4-glucans. 

The fungal lyase isolated from fungal infected Gracilariopsis lemaneiformis is 
characterized as having a pH optimum at 3.5-7,5 when amylopectin is used, a 
temperature optimum at 50°C and a pi of 3.9. 

10 

In another embodiment, the a-I,4-glucan lyase is purified from the fungus Morchella 
costata by affinity chiomatogr^hy on i3-cyclodextrin Sepbaiose, ion exchange 
chromatography on Mono Q HR 5/5 and gel filtration on Superose 12 columns* The 
purified enzyme produces 1,5-anhydro-D-ihictose from a-l,4-glucans. 

15 

The fungal lyase shows a pi around 5.4 as determined by isoelectric focusing on gels 
^th pH gradient of 3 to 9. The molecular weight determined by SDS-PAGE on 8- 
25% gradient gels was 110 kDa. The enzyme (^bited a pH optimum in the range 
pH 5-7. The temp^ture cqptimum was found to be between 30-45 ^C. 

20 

In another embodiment, the a-l,4-glucan lyase is purified from the fungus Morchella 
vulgaris by affinity chromatography on j3-cyclodextrin Sepharose, ion exchange 
chromatography on Mono Q HR 5/5 and gel filtration on Superose 12 colunms. The 
purified enzyme produces 1,5-anhydro-D-fructose from a-l,4-glucans. 

25 

In another embodiment, the a-l,4-glucan lyase is purified from algae - such as 
Gracilariopsis lemaneiformis - by affinity chromatography on ^-cyclodextrin Sepharo- 
se, ion exchange chromatography on Mono Q HR 5/5 and gel filtration on Superose 
12 columns* The purified enzyme produces 1,5-anhydro-D-fructose from a-1,4- 
30 glucans. 
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Typical pH and temperature optima for the lyase catalyzed reaction for some of the 
GL enzymes according to the present invention are as follows: 



5 GL sources Optimal pH Optimal pH range Optimal temperature 
M. costma 6.5 5.5-7.5 37 C; 40 C 



M. vulgaris 6.4 5.9-7.6 43 C; 48 C* 
10 

Fungal infected Gracilariopsis 

lemaneiformis 3.8 3.7-4.1 40 C; 45 C* 



Parameters det^mined using glycogen as substrate; other parameters detennined 
15 using amylopectin as substrate. 

The enzymes of the present invention convert amylose and amylopectin to 1,5- 
anhydrofnictose. 

20 Among the maltosacchaiides tested^ we found that the lyase showed low activity 
towards maltose, and lower activity to maltotriose and maltoheptaose with the highest 
activity to maltotetraose and maltopentaose. The enzyme showed no substrate 
inhibition up to a concentration 10 mg ml'^ among these maltosaccharides. 

25 The enzymes from each of the preferred sources has been sequenced and the amino 
acid sequences are presented later. Also presented later are the DNA sequences 
coding for the enzymes. 



30 



The present invention therefore describes a new starch degrading enzyme - namely 
a new a-l^^-glucan lyase. This is an enzyme that has been purified and characterized 
for the first time. 
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As mentioned above, the present invention also relates to some specific uses of AF. 

In particular, the present invention relates to the use of 1,5-D-anhydrofructose 
("AF"), as an anti-oxidant, in particular as an anti-oxidant for food stuffs and 
5 beverages. 

Therefore according to the present invention there is provided the use of 1,5-D- 
anhydrofhictose (AF) as an anti-oxidant. 

10 Pief^^ably AF is or is used in an edible substance. 

Preferably AF is used in or as a foodstuff or beverage. 

Prefnably, AF is used in combination with anoth^* anti-oxidant. 

15 

Preferably the AF is prqiared by the method according to the present invention. 

The main advantages of using AF as an anti-oxidant are that it is a natural product, 
it is non-metabolisable, it is easy to manufacture, it is water-soluble, and it is 
20 generally non-toxic. 

In a preferred embodiment the present invention therefore relates to the enzymatic 
preparation of pure AF which can be used as an attractive water soluble antioxidant 
for food and non-food purposes. In the application examples arc given for the use 
25 of AF as an antioxidant in food formulations. 

In the accompanying examples it is seen that AF is comparable with known high 
quality commercial available food antioxidants. 

30 Non-food examples include use in polymer chemistry as oxygen scavengers during 
the synthesis of polymers. Also, AF could be used for the synthesis of bio- 
degradable plastic. 
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Experiments have shown that AF can be an efficient reducing agent (antioxidant), as 
it can easily reduce 3,5-dinitrosalicylic acid to 3-aniino-5-nilrosalicylic acid. 

AF is a naturally occurring substance and therefore it has a tremendous potential for 
5 use as an acceptable antioxidant. AF can also be converted into the antibiotic 
microthecin by AF dehydrase. Antibiotics are known for their uses in food 
biopreservation, an important area in food biotechnology. 

In another aspect, the present invention also relates to the use of 1,S-D- 
10 anhydrofhictosc as a sweetener, in particular as a sweetraer for foodstuffs and 
beverages, preferably human foodstuffs and bev^^es. 

Thus according to this aspect of the present invention these is provided the use of 1 ,5- 
D-anhydrofhictose as a sweetener. 

15 

Preferably the AF is used as or in a human foodstuff or beverage. 

The AF may be used in any desired amount such as a 5% soution or lOOmg/kg to 
500mg/kg. 

20 

The advantages of using AF as a sweetener are that it is a natural product, it is 
generally non-toxic, it is water soluble, it is non-metabolisable and it is easy to 
manufacture. 

25 The present invention therefore also relates to a novel application of AF as a 
sweetener. 

Preferably the AF is prepared by the method according to the present invention. 



30 Further aspects of the present invention include: 
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a method of preparing the enzyme a-I,4-glucan lyase (GL) comprising 
isolating the enzyme from a fungally infected algae, fungus or algae alone; 

an enzyme comprising the amino acid sequence SEQ. ID. No. 1 . or SEQ. ID. 
5 No. 2 or SEQ. ID, No. 5. or SEQ. ID. No. 6, or any variant thereof; 

an enzyme comprising the amino acid sequence SEQ. ID. No. 9. or SEQ. ID. 
No. 10 or SEQ. ID. No. 11, or any variant thereof; 

10 a nucleotide sequence coding for the enzyme a-l,4-glucan lyase, preferably 

wherein the sequence is not in its natural enviroment (i.e. it does not form part of the 
natural genome of a cellular organism capable of expressing the enzymie, preferably 
wherdn the nucleotide sequence is a DNA sequence; 

15 a nucleotide sequence wherein the DNA sequence comprises at least a 

sequence that is the same as, or is complementary to, or has substantial homology 
with, or contains any suitable codon substitutions for any of those of, SEQ. ID. No. 
3 or SEQ. ID. No. 4 or SEQ. ID. No. 7 or SEQ. ID. No. 8, preferably wherein the 
sequence is in isolated form; 

20 

a nucleotide sequence wherein the DNA sequence comprises at least a 
sequence that is the same as, or is complementary to, or has substantial homology 
with, or contains any suitable codon substitutions for any of those of, SEQ. ID. No. 
12 or SEQ. ID. No. 13 or SEQ. ID. No. 14, preferably wherein the sequence is in 
25 isolated form; and 

the use of beta-cyclodextrin to purify an enzyme, preferably GL. 

Other preferred embodiments of the present invention include any one of the 
30 following: A transformed host organism having the capability of producing AF as 
a consequence of the introduction of a DNA sequence as herein described; such a 
transformed host organism which is a microorganism - preferably wherein the host 
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organism is selected from the group consisting of bacteria, moulds, fungi and yeast; 
preferably the host organism is selected from the group consisting of Saccharomyces, 
Kluyveromyces, Aspergillus, Trichoderma Hansenula, Pichia, Bacillus Streptomyces, 
Eschericia such as Aspergillus oryzae, Saccharomyces cerevisiae, bacillus sublilis, 
5 Bacillus amyloliquefascien, Eschericia coli. ; A method for preparing the sugar 1 ,5-D- 
anhydrofructosc comprising the use of a transformed host organism expressing a 
nucleotide sequence encoding the enzyme a-l,4-glucan lyase, preferably wherein the 
nucleotide sequence is a DNA sequence, preferably wherein the DNA sequence is one 
of the sequences hereinbefore described; A vector incorporating a nucleotide sequence 

10 as hereinbefore described, preferably wherdn the vector is a replication vector, 
preferably wherein the vector is an expression vector containing the nucleotide 
sequence downstream from a promoter sequence, preferably the vector includes a 
marker (such as a resistance marker); Cellular oiganisms^ or cell line, transformed 
with such a vector; A method of producing the product a-l,4-glucan lyase or any 

15 nucleotide sequence or part thereof coding for same, which comprises culturing such 
an organism (or cells from a cell line) transfected with such a vector and recovering 
the product. 

In particular, in the expression systems, the enzyme should preferably be secreted to 
20 ease its purification. To do so the DNA encoding the mature enzyme is fused to a 
signal sequence, a promoter and a terminator from the chosen host. 

For expression in Aspergillus niger the gpdA (from the GIyceraldehyde-3-phosphate 
dehydrogenase gene of Aspergillus nidulans) promoter and signal sequence is fused 

25 to the 5' end of the DNA encoding the mature lyase. The terminator sequence from 
the A. niger trpC gene is placed 3' to the gene (Punt, P.J. et al 1991 - (1991): J. 
Biotech. 17, 19-34). This construction is inserted into a vector containing a 
replication origin and selection origin for E. coli and a selection marker for ^. niger, 
ETiamples of selection markers for A. rdger are the amdS gene, the argB gene, the 

30 pyrG gene, the hygB gene, the BmlR g^e which all have been used for selection of 
transformants. This plasmid can be transformed into A. niger and the mature lyase 
can be recovered from the culture medium of the transformants. Eventually the 
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construction could be transformed into a protease deficient strain to reduce the 
proteolytic degradation of the lyase in the culture medium (Archer D,B. ct all992 - 
Biotechnol. Utt. 14, 357-362). 

5 Instead of Aspergillus niger as host, other industrial important microorganisms for 
which good expression systems are known could be used such as: Aspergillus oryzae, 
Aspergillus sp,, Trichoderma sp,, Saccharomyces cerevisiae, Kluyveromyces sp., 
Hansenula sp,, Pichia sp.. Bacillus subtilis, B. amyloliquefaciens, Bacillus sp., 
Streptomyces sp. or £. colL 

10 

The following samples were deposited in accordance with the Budapest Treaty at the 
recognised depositary The National Collections of Industrial and Marine Bacteria 
Limited (NCIMB) at 23 St. Machar Drive, Aberdeen, Scotland, United Kingdom, 
AB2 IRY on 20 June 1994: 

15 

KColi containing plasmid pGLl (NCIMB 40652) - [ref. DHSalpha-pGLl]; and 

E.Coli containing plasmid pGL2 (NCIMB 40653) - [ref. DH5alpha-pGL2]. 

20 The following sample was accepted as a deposit in accordance with the Buds^st 
Treaty at the recognised depositary The Culture Collection of Algae and Protozoa 
(CCAP) at Dunstaffnage Marine Laboratory PO Box 3, Oban, Argyll, Scotland, 
United Kingdom, PA34 4AD on 1 1 October 1994: 

25 Fungally infected Gracilariopsis lemaneiformis (CCAP 1373/1) - [ref, GLQ-1 
(Qingdao)]. 

Thus highly preferred embodiments of the present invention include a GL enzyme 
obtainable from the expression of the GL coding sequences present in plasmids that 
30 are the subject of either deposit NCIMB 40652 or deposit NCIMB 40653; and a GL 
enzyme obtainable from the fungally infected algae that is the subject of deposit 
CCAP 1373/1. 
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The following samples were deposited in accordance with the Budapest Treaty at the 
recognised depositary The National Collections of Industrial and Marine Bacteria 
Limited (NCIMB) at 23 St. Machar Drive, Aberdeen, Scotland, United Kingdom, 
AB2 IRY on 3 October 1994: 

5 

E.Coli containing plasmid pMC (NCIMB 40687) - [ref, DHSalpha-pMC]; 
KColi containing plasmid pMVl (NCIMB 40688) - [ref. DH5alpha-pMVl]; and. 
10 RColi containing plasmid pMV2 (NCIMB 40689) - [ref. DH5alpha-pMV2]. 

Plasmid pMC is a pBluescript II KS containing a 4.1 kb fragment isolated from a 
genomic libiary constructed from Morchella costata. The fragment contains a gene 
coding for a-l,4-glucan lyase. 

15 

Plasmid pMVl is a pBluescript II KS containing a 2.45 kb fragment isolated from a 
genomic library constructed from Morchella vulgaris. The fragment contains the 5^ 
end of a gene coding for a-l,4-glucan lyase. 

20 Plasmid MV2 is a pPUC19 containing a 3.1 kb fragmwit isolated from a genomic 
library constructed from Morchella vulgaris. The fragment contains the 3' end of a 
gone coding for a-l,4*glucan lyase. 

In the following discussions, MC represents Morchella costata and MV represents 
25 Morchella vulgaris. 

As mentioned, the GL coding sequence from Morchella vulgaris was contained in two 
plasmids. With reference to Figure 15 pMVl contains the nucleotides from position 
454 to position 2902; and pMV2 contains the nucleotides downstream from (and 
30 including) position 2897. With reference to Figures 12 and 13, to ligate the coding 
sequences one can digest pMV2 with restriction enzymes EcoRI and BamHI and then 
insert the relevant fragment into pMVl digested with restriction enzymes EcoRI and 
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BamHL 

Thus highly preferred embodiments of the present invention include a GL enzyme 
obtainable from the expression of the GL coding sequences present in plasmids that 
5 are the subject of either deposit NCIMB 40687 or deposit NCIMB 40688 and deposit 
NCIMB 40689. 

The following sample was also accepted as a dqx)sit in accordance with Ihe Budapest 
Treaty at the recognised dqxisitary The Culture Collection of Algae and Protozoa 
10 (CCAP) at Dunstaffhage Marine Laboratory PO Box 3, Oban, Argyll, Scotland, 
United Kingdom, PA34 4AD on 11 October 1994: 

Fungally infected Gracilariopsh lemaneifomis (CCAP 1373/2) - [lef. GLSC-1 
(California)]. 

15 

Thus a highly preferred embodiment of the present invention includes a GL enzyme 
obtainable from the algae that is the subject of deposit CCAP 1373/2. 

The present invention will now be described only by way of example. 

20 

In the following Examples reference is made to the accompanying figures in which: 
Figure 1 shows stained fungally infected algae; 
25 Figure 2 shows stained fungally infected algae; 
Figure 3 shows sections of fungal hypha; 
Figure 4 shows sections of fungally infected algae; 

30 

Figure 5 shows a section of fungally infected algae; 
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Figure 6 shows a plasmid map of pGLl; 

Figure 7 shows a plasmid map of pGL2; 

5 Figure 8 shows the amino acid sequence represented as SEQ. I.D. No.3 showing 
positions of the peptide fragments that were sequmced; 

Figure 9 shows the alignment of SEQ. I.D. No. 1 with SEQ. I.D. No.2; 

10 Figurc 10 is a microphotograph; 

Figure 11 shows a plasmid map of pMC; 
Figure 12 shows a plasmid map of pMVl; 

15 

Figure 13 shows a plasmid map of pMV2; 

Figure 14 shows the GL coding sequence and part of the 5' and 3* non-translated 
regions for genomic DNA obtained from Morchella costata; 

20 

Figure 15 shows the GL coding sequence and part of the 5' and 3' non-translated 
r^ons for genomic DNA obtained from Morchella vulgaris; 

Figure 16 shows a comparison of the GL coding sequences and non-translated regions 
25 from Morchella costata and Morchella vulgaris; 

Figure 17 shows the amino acid sequence represented as SEQ. LD. No. 5 showing 
positions of the p^tide fragments that were sequenced; 

30 Figure 18 shows the amino acid sequence represented as SEQ. I.D. No. 6 showing 
positions of the peptide fragments that were sequenced; 
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Figure 19 shows a graph of oxygen consumption with and without the presence of 
AF; and 

Figure 20 shows a TLC plate. 

5 

In more detail, Figure 1 shows Calcoflour White siainings revealing ftingi in upper 
part and lower part of Gracilariopsis lemaneifomis (108x and 294x). 

Figure 2 shows PAS/Anilinblue Black staining of Gracilariopsis lenumeiformis with 
10 fungi. The fungi have a significant higher content of carbohydrates. 

Figure 3 shows a micrograph showing longitudinal and grazing sections of two thin- 
walled fungal hypha (f) growing between thick walls (w) of algal cells. Note 
thylacoid membranes in the algal chloroplast (arrows). 

15 

Figure 4 shows the antismse detections with clone 2 probe (upper row) appear to be 
restricted to the fungi illustrated by Calcoflour White staining of the succeeding 
section Oower row) (46x and lOBx). 

20 Figure 5 shows intense antisense detections with clone 2 probe are found over the 
fungi in Gracilariopsis lenumeiformis (294x)« 

Figure 6 shows a map of plasmid pGLl - which is a pBluescript II KS containing a 
3.8 kb fragment isolated from a genomic library constructed from fungal infected 
25 Gracilariopsis lemaneiformis. The fragment contains a gene coding for alpha- 1,4- 
glucan lyase. 

Figure 7 shows a map of plasmid pGL2 - which is a pBluescript II SK containing a 
3.6 kb fragment isolated from a genomic library constructed from fungal infected 
30 Gracilariopsis lemaneiformis. The fragment contains a gene coding for alpha- 1,4- 
glucan lyase. 
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Figure 9 shows the alignment of SEQ. I.D. No. 1 (GLl) with SEQ. LD. No.2 
{GL2). The total number of residues for GLI is 1088; and the total number of 
residues for GL2 is 1091 . In making the comparison, a structure-genetic matrix was 
used (Open gap cost: 10; Unit gap cost: 2). In Figure 9 the character to show that 
5 two aligned residues are identical is and the character to show that two aligned 
residues are similar is \\ Amino acids said to be 'similar* are: A,S,T; D,E; N,Q; 
R,K; I,L,M,V; F,Y,W. Overall there is an identity of 845 amino acids (i.e. 
77.67%); a similarity of 60 amino acids (5.51%). The number of gaps inserted in 
GLl are 3 and the number of gaps inserted in GL2 are 2. 

10 

Figure 10 is a microphotograph of a fungal hypha (f) growing between the algal walls 
(w). Note grains of floridean starch (s) and thylakoids (arrows) in the algal cell. 

In Figure 14^ the total number of bases is 4726 - and the DNA sequence composition 
15 is: 1336 A; 1070 C; 1051 G; 1269 T. The ATG start codon is shown in bold. The 
introns are underlined. The stop codon is shown in italics. 

In Figure 15, the total number of bases is 4670 - and the DNA sequence composition 
is: 1253 A; 1072 C; 1080 G; 1265 T. The ATG start codon is shown in bold. The 
20 introns are underlined. The stop codon is shown in italics. 

In Figure 16, the two aligned sequences are those obtained from MC (total number 
of residues: 1066) and MV (total number of residues: 1070). The comparison matrix 
used was a structure-genetic matrix (Open gap cost: 10; Unit gap cost : 2). In this 
25 Figure, the character to show that two aligned residues are identical is ':'. The 
character to show that two aligned residues are similar is \*. The amino acids said 
to be 'similar' are: A,S,T; D,E; N,Q; R,K; I,L,M,V; F,Y,W. Overall there is: 
Identity: 920 (86.30%); Similarity: 51 (4.78%). The number of gaps inserted in MC 
is 1 and the number of gaps inserted in MV is 1. 

30 
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In the attached sequence listings: SEQ. I.D.No. 5 is the amino-acid sequence for GL 
obtained from Morchella costata; SEQ. I.D.No. 6 is the amino-acid sequence for GL 
obtained from Morchella vulgaris; SEQ. I.D. No. 7 is the nucleotide coding sequence 
for GL obtained from Morchella costata; and SEQ. LD.No. 8 is the nucleotide 
coding sequence for GL obtained from Morchella vulgaris. 

In SEQ. LD. No. 5 the total number of residues is 1066. The GL enzyme has an 
amino acid composition of: 



46 Ala 


13Cys 


25 His 


18 Met 


73Thr 


50 Arg 


37 Gin 


54 He 


43Phe 


23 Trp 


56 Asn 


SSGlu 


70 Leu 


56 Pro 


71 Tyr 


75 A^ 


89GIy 


71 Lys 


63Ser 


78 Val 



In SEQ.LD. No. 6 the total number of residues is 1070. The GL enzyme has an 
amino acid composition of: 



51 Ala 


13Cys 


22 His 


17 Met 


71 Thr 


50 Arg 


40 Gin 


57 He 


45 Phe 


24Trp 


62 Asn 


SSGlu 


74 Leu 


62 Pro 


69 Tyr 


74 Asp 


87Gly 


61 Lys 


55 Ser 


78 Val 



EXPERIMENTS 

1 THE SOLUBLE ENZYME SYSTEM: 

L 1. Effect of pH on the stability and activity of the lyase isolated from fiigal infected 
Gracilariopsis lemaneiformis. 

Two buffer systems, namely HOAc and NaOAc and sodium citrate - citric acid in a 
concentration of 5 mM - were tested at 3TC, The pH range tested was from pH 3 to 
pH 5.2. The lyase showed maximum activity in a pH range between 3.6 to 4.2. At 



wo 95/10616 



PCT/EP94/03397 



21 

pH 3, the stability and activity of the enzyme decreased by about 90%. At pH 5.2, 
the activity decreased by about 64%, However, the enzyme was considerably more 
stable at this pH than at pH 3, as the AF yield obtained at pH 5.2 was 75% of the 
AF yield obtained at pH 3.8. Slightly higher AF yield was obtained in the HOAc and 
5 NaOAc buffer than in citrate buffer. This is not due to any differential effect of the 
two buffers (final cone, is 125 in the AF assay mixture) in the AF assay method. 

1.2. Effect of temperature on the activity and stability of the lyase. 

10 This experiment was conducted at optimal pH range. At 25''C the production of AF 
was linear up to at least 9 days. This indicates that no loss of activity and stability of 
the lyase occurred within 9 days. With increasing temperature, the stability of the 
enzyme decreased. 

15 The half life of the enzyme activity at the following temperature was: 



20 



30"C 5 days 
37°C 2.5 days 

less than 1 day 
SO^C less than 1 day 



1.3. Effect of substrate concentration on the stability of the lyase and AF yield. 

It was observed that amylopectin and dextrins have a stabili;dng effect on the lyase 
25 while the smallest substrate maltose does not. This was verified for both the soluble 
enzyme system and the immobilized enzyme system. 

AF yield increases with the increase in amylopectin concentration up to 25%. In the 
case of dextrin, the AF yield decreases when the concentration acceeds 30% (30%, 
30 40% and 50% were tested). 
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1.4 Activation and inactivation of lyase 

No metal ions are found necessary for the activity and the enzyme catalysed reaction 
can surprisingly proceed in pure water. The fact that the addition of EDTA in the 
5 reaction mixture up to 20 mM had little effect on the activity clearly demonstrates 
that metal ions are not essential for the activity of the lyase enzyme according to the 
present invention. 

This means that in the AF purification step, the ion exchange chromatography step 
10 that takes away salts ftom the reaction system can be omitted, if water is used as 
reaction medium. However, inclusion of NaCl in the reaction mixture in a 
concentration of 0.85% (0.145 M) can increase the AF yield up to 1-fold. 

L5. Substrate Specifidty 

15 

Upon cooling solubilized starch will tend to form rigid gels when the starch 
concentration becomes to high. Therefore it is an advantage to utilize partly degraded 
starch as substrate for the 1,4-glucan lyase. 

20 The specificity of a-l,4-glucan lyase isolated from M. costata for different 
oligosaccharides was tested. The oligosaccharides were maltose (G2), maltotriose 
(G3), maltotetraose (G4), maltopentaose (OS), maltohexaose (G6) and maltoheptaose 
(G7). The oligosaccharides were dissolved in HjO at a concentration of 8 mg/ml. The 
enzyme assay contained 150 ftl substrate G2/G3/G4/G5/G6/G7, 120 id QAM MES 

25 pH 6.3 and 30 ^1 purified enzyme. The reaction mixture was incubated for 60 min 
at 30^. Aft^ivards the reaction was stopped by boiling for 3 min and 900 fil 
absolute ethanol was added for precipitation. After centrifugation at 20.000 x g for 
5 min at 4*X^ the supernatant was transferred to a new eppendorf tube and lyophilized. 

30 The freeze-dried samples were dissolved in 1000 /il HjO and were filtrated through 
a 0.22 /im Millipore filter before 25 ii\ of the sample was loaded on the Dionex 
HPLC. 
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L7 HPLC 

Analytical procedures. 

Analyses were performed on a Dionex 4500i chromatography system consisting of a 
S GPM-2 pump and a PED detector which was used in pulse amperometric detection 
miode. 

The anion exchange columns were a CaiboPac PA- 100 (4 x 250 mm) and a CaiboPac 
PA-100 guard colunui (3 x 25 mm) from Dionex. 

10 

The eluent were 200 mM sodium hydroxide (A), 500 mM sodium acetate (B) and 18 
M ohm de-ionized water (C) . The pump was programmed in 2 differoit ways, 
method no. 1 and method no. 2: 

15 Method no. 1: 



Time, min 


0.0 


3.0 


3.1 


26.9 


29.0 . j 


% A 


10 


10 


50 


50 


10 


1 


0 


0 


0 


32 


0 


1 ^ C 


90 


90 


50 


18 


90 



Method no. 2: 



Time, min. 


0.0 


30 


% A 


10 


10 


% B 


0 


0 


1 % c 


90 


90 
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Standard^ : 

Glucose, maltose, maltotriose, maltotetraose, nialtopentaose, maltohexaose and 
maltoheptaose (all from Sigma) and 1,5-anhydrofructose were used as standards. All 
5 compounds were dissolved in 18 M ohm de-ionized water which was filtered through 
a 0.22 lira Millipore filter before use. 

LT Rcsuhss 

10 The analyses show that the purified enzyme which was isolated from M. costata 
indeed was able to use maltooligosaccharides as substrate 1 for l^S-anhydrofructose 
formation. 

When maltose was used as substrate, almost no I^S-anhydrofiructose was formed but 
15 when the other maltooligosaccharides (G3-G7) were used, high amounts of this 
compound were produced. 

It is clear that higher amounts of l,S-anhydrofructose were obtained when a longer 
maltooligosaccharide was used. 

20 

This observation corresponds perfecdy well with the theory of the lyase forming 1 ,5- 
anhydrofructose from the non-ieducing end of the substrate, leaving only the tmninal 
glucose molecule unchanged. 

25 1,8 Formation of AF 

of-l,4-glucan lyase from M.costata hydrolyses starch to the end-product 1,5- 
anhydrofructose. The end-product was shown by HPLC, method 2. The enzyme assay 
contained 500 fil amylopectin (20 mg/ml, dissolved m H2O), 400 /ilOAM MES pH 
30 6.3 and 100 fil purified enzyme. The reaction mixture was incubated at 30^ and the 
reaction was stopped by boiling after 30 or 120 min incubation. High-molecular 
oligosaccharides were precipitated by addition of 3 vol abs. ethanol and the sample 
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was centrifuged and freeze-dried as described above. The samples were dissolved in 
125 /xl H2O and 25 fil were applied on the HPLC column. 

The HPLC elution profile clearly shows that a-l,4-giucan lyase from M.costata 
S produces 1,5-anhydrofructose by hydrolysis of starch. Equal amounts of 1,5- 
anhydrofructose were found after 30 and 120 min. incubation which indicate that the 
enzyme activity is not inhibited by the endproduct 1^5-anhydrofructose. 

^'C NMR spectra (water) of AF prepared in this way shows that it adopts one major 
10 form giving rise to the following signals: 5 93.5 (quart, C-2), 81.5 (CH, C-5), 77.7 
(CH, C-3), 72.6 (CH2, C-1), 69,8 (CH, C-4), 62.0 (CHj, 0-6). Assignments are 
based on H-H C-H and C-H 2D correlation spectra. 

1.6. The cooperative effect of lyase with pullulanase and isoamylase. 

15 

As it can be seen from Table 1, the inclusion of pullulanase in the reaction mixture 
will obviously increase the AF yield by about 15-23%, depending on whether soluble 
starch or amylopectin is used as substrate. 
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Table The cooperation of pullulanase and lyase in the production 
of AF. 



Substrate 


Lyase 


Pullulanase 


AF Yield (%) 


Glc Yield (% 
) 


Solubl . 
Starch 


+ 


+ 


51 
0 


0 

0.37 








66 0 




Amylo 
-pectin 


+ 




48.0 


0 






+ 


0 


0.33 




+ 


+ 


71.3 


3.7 


+, enzyme 


added. 


^ enzyme omitted. 







The reaction mixture contained 0.3 ml 2% potato amylopectin (Sigma) in water or 0.3 
ml 2% soluble starch (Merck), 2 /d lyase and 0.36 units pullulanase (BM) as 
indicated. 

5 

The reaction was carried out at 30°C for 1 day. At the end of the reaction, samples 
were taken for AF and Glc analysis. 

In the case of isoamylase, the advantage is that the optimal pH of the lyase overlaps 
10 with that of Pseudomonas isoamylase (pH 3.0-4.5). The problem, however, is that 
isoamylase will produce an excess amount of long chain amylose that precipitates 
from the solution, and therefore is no longer suitable as a substrate for the lyase. It 
can be expected that the cooperation of the lyase with isoamylase will be efficient, 
if the chain of amylose is not too long. 
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2. THE IMMOBILIZED ENZYME SYSTEM 

Immobilization of the lyase was achieved by using succinimide-activated Sepharose 
(Affigel 15 gel, Bio-Rad) and glutaradehye-activated Silica gel (BM). The recovery 
S of lyase activity after immobilization on Affigel 15 gel was between 40% to 50%. 
There may be some lyase that is still active after immobilization, but is inaccessible 
to the substrate because of the steric hindrance, especially in the case of 
macromolecules like starches. Immobilized enzymes used in the industry usually have 
an activity recovery of around 50%. 

10 

The most interesting thing of the Affigel 15 gel immobilized lyase is that its stability 
has been greatly improved at pH 5,5. When the column was operated at this pH, the 
stability was at least 16 days long. The pH shift in the stability is very important 
considering the optimal pH of puUulanase which is around pH 5.5. This is the 
15 prerequisite for the lyase and puUulanase to cooperate efficiently in the same reactor 
with the same physico-chemical environment. The soluble lyase has an optimal pH 
between 3.6 and 4.2, and at this pH range puUulanase shows little or no activity. 

With the silica gel immobilized lyase, the activity recovery is very high, around 80- 
20 100% . However, the silica gel immobilized enzyme was not stable when the colunm 
was operated neither at pH 3.8 nor pH 5.5. It is possible that some lyase was ad- 
sorbed on the surface of the silica gel beads and was slowly released from the silica 
gel after each washing of the column. It may therefore be the adsorbed lyase that 
contributes to the high recovery rate and the decrease in column activity. 

25 
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3. PURmCATION OF AF 

3.1. The lyase-Amylopectin/Soluble Starch System 

5 In this system, the reaction system contained AF, limit dextrin, the lyase, and buffer 
salts at the end of the reaction. AF ^vas sq^arated from the macromolecules (^mt 
dextrin and the lyase) by ethanol (final cone. 50%) precipitation. Unpredpitated low- 
molecular-weight amylopectin was sq)arated by ultrafiltration using Amicon YM3 
membranes (cut-off 3,000). Ethanol was removed by evaporation at 40^C in a rotary 
10 evaporator. Buffer salts were removed from AF by mixed ion exchangers. Purified 
solid AF was obtained by freeze-drying. 

3.2. The Lyase-PuUulanase/Amylopecdn/Soluble Starch System. 

15 In this system the final products are AF and glucose. If at least a substantially pure 
sample of AF is to be prepared, the by-product glucose must be removed. This can 
be achieved by enzyimAc methods. First the glucose is converted into gluconic acid 
and hydrogen peroxide by glucose oxidase. 

20 Catalase is needed to dispel H2O3 formed. H2O2 will oxidize AF into two new 
compounds which are at present of unknown structure. The other impurities in the AF 
prqiaration are the oxidation products of AF. It was observed that AF can slowly be 
oxidized by air-level of oxygen, especially at high temperature, high AF concentration 
and long time of exposure. 

25 

Gluconic acid was removed together with the buffer salts by ion exchange 
chromatography. 

In this system, the low-molecular-weight amylopectin molecules may alternatively be 
30 hydrolysed by amyloglucosidase instead of using ultrafiltration. 
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3.3. The purity checking of AF. 

The purity of the AF preparations were confirmed by TLC, Dionex and NMR. 

3.4 Analysis of the antioxidative activity of anhydro ihictose. 

Electrochemical oxygen consumption: 
Method. 

The activity of AF was investigated in a methyl linoleate emulsion as descrit)ed by 
Joigensen and Skibsted (Z. Lebensm. Unters. Forsch. (1993) 196: 423-429) with 
minor modifications: To 5.00 ml of a 1.33 mM methyl linoleate emulsion in 5.0 mM 
aqueous phosphate buffer with pH = 5.8 and 0,2 w/w % Tween 20 as emulsifier was 
added AF in the following concentrations: 0, 15, 146 and 680 fiM. The oxidation in 
the system was initiated by addition of 50 /xl 0.26 M metmyoglobin (MMb) final 
concentration 0.26 mM. Immediately after initiating the reaction the sample was 
injected to a thermostated (25.0 + O.rC) 70 /il closed cell, effectively excluding 
diffusion of oxygm into the system. The oxygen consumption was measured by a 
Gark electrode, which was connected to a PC data collection program. The relative 
oxygen concentration (%) was registered every 30s. 

Results. 

Curves corresponding to oxygen consumption for the different samples are illustrated 
in Figure 19. For samples without addition of AF a relative decrease in oxygen 
concentration is se^ immediately after injection of the sample. For samples 
containing AF a lag-phase is observed before the curve breaks off and the oxygen 
concenu^tion is reduced. After the lag-phase only a minor reduction in the oxygen 
consumption rate is observed compared to samples without AF added. A tendency for 
samples having the highest amount of AF to have the longest lag-phase is observed. 
As well the rate for oxygen consumption is lower for these samples, which is seen 
by a smaller slope of the curves compared to the slope for the references (0 /xM). 
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ESR analysis 
Method. 

Hydroxy! radicals were generated by a Fenton reaction with H2O2 (0.17 mM) and 
5 FeS04 (4.8 /xM). The generated radicals were trapped by 5,5-dimethyl-l-pynoline N- 
oxide (DMPO, 9.7 mM). AF was added in concentrations of L3 mM and 6.3 mM. 
A water soluble extract of rosemary (Rosmarinus qfficimlis L.) was analyzed in a 
concentration of 0.25 mg/ml (in grams equivalent to 1.26 mM AF). Measurements 
were carried out at room temperature (20 ± VC) after 120 s and repeated for the 
10 same reaction mixture after 300 s with the following spectrometer settings: Center 
field 3475.60 G; sweqp width 55 G; microwave power 20 mW; modulation frequency 
100 kHz; modulation amplitude 1.01 G; receiver gain 1.00 • 10^; conversion time 
81.92 ms time constant 163.84 ms and sweep time 83.89 s. 

15 Results. 

The generated hydroxyl radicals were trapped by DMPO. The spin adduct gives rise 
to a characteristic 1:2:2:1 ESR spectrum. The peak height of the spectrum is 
proportional to the quantitative amount of generated spin adduct. Addition of both 
20 DMPO and AF will set up a competition between the spin trap and AF. A reduction 
of peak height will indicate a good scavenging activity of AF. 
Table: Peak height of ESR-spectra, H2O2 = 0.17mM and Fe^*" = 4.8 ^M. 



Anhydro fructose [mM] 


Rosemary extract [mg/ml] 


Peak hdght 
[120 s] 


Peak height | 
[300 s] 


0 


0 


2475 


2780 


1.3 


0 


2634 


2545 


6.3 


0 


1781 


1900 
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At a concentration of 1 3 mM AF no scavenging activity of hydroxyl radicals is seen, 
at 6.3 mM Af the peak height is reduced, indicating that a part of the generated 
hydroxyl radicals is scavenged by AF. 

5 4. USE OF AF AS AN ANTI-OXIDANT 

EXAMPLE 4.1 

Use Pf AF 3S 9n witi-PXidant in ^ ?Q% maypm^se, 

10 

50% mayonnaise is used for salads, open sandwiches, etc. in both the catering and 
the retail trades. The low oil content of 50% mayonnaise makes it suitable for low- 
calorie applications. 

IS A typical mayonnaise composition is as follows: 



Soya oil 


50.0% 


Tarragon vinegar (10%) 


4.0% 


Egg yolk 


3.5% 


Sugar 


3.0% 


Salt 


1.0% 


Potassium soibate 


0.1% 


Water 


35.2% 


MAYODAN 602 


3.0% 


Lemon flavouring 10251 


0.2% 



MAYODAN 602 ensures a fine, stable oil dispersion and the required viscosity, 
thereby providing 50% mayonn^se with a long shelf life. 

30 Flavouring 10251 is a natural lemon flavouring which provides mayonnaise with the 
fresh taste of lemon. 
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Typically the mayonnaise is prepared by the following method: 

1) Dry mix the MAYODAN 602, sugar and salt. Disperse in oil in a ratio of 1 part 
powder to 2 parts oil. 

5 

2) Add flavouring and potassium sorbate to the water and pour into the Koruma 
mixer. Add 1). 

3) Add the egg yolk. 

10 

4) Add the oil continuously in a vacuum. 

5) After 2/3 of the oil has been added (slowly), blend the tarragon vinegar with the 
lemaining 1/3 of the cnl, and add. 

15 

The following data show that when AF is added to the mayonnaise as an anti-oxidant 
the results are comparable to the known food anti-oxididants GRINDOX 142 and 
GRINDOX 1029. 



20 GRINDOX 142: 

Ascorbyi palnutate 10% 

Propyl gallate 20% 

Citric acid 10% 

Food grade emulsifier 60% 

25 Form at 25"^ paste 

Colour grey to pale brown 

Density 1.1 g/ml 



(All percentages are by weight) 
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GRINDOX 1029: 

Ascorbyl palmitate 20% 

Natural tocopherols 20% 

Food grade emulsifier 60% 

5 Form at 25X paste 

Colour light brown 

Density at 25*C 1,0 g/ml 



(All percentages are by weight) 

10 In the test procedure the anti-oxidants were added to the mayonnaise to provide an 
anti-oxidant concentration in the order of about 500 ppm. The mayonnaise was then 
placed in a bomb calorimeter at temperature SOT containing pure Oj. An inductiom 
period to the onset of substantial oxidation of the product is then measured. 

IS The results were as follows. 



Samples: IP (hours) 

1. Blank 28,0 

2. -f 500 ppm GRINDOX 142 35,0 
20 3. + 500 ppm GRINDOX 1029 33,3 

4. + 550 ppm GRINDOX 1029 34,3 

5. + 500ppm 1,5 anhydro-D-fmctose 32,0 
(IP hours = Induction Period) 



25 



These results show that AF is an excellent food anti-oxidant and is comparable with 
the known foodstuffs anti-oxidants GRINDOX 142 or GRINDOX 1029. 
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EXAMPLE 4,2 

Use of AF as an anti-pxidant in a salad dressing 

5 YOGURT SALAD DRESSING WITH 50% OIL 

Yogurt salad dressing with 50% oil is used for salads, potatoes^ raw vegetable salad, 
meat, fish and boiled vegetables. 

10 Composition 



Soya oil 


50.0% 


Yogurt (jisan) 


39.0% 


Vinegar (10%) 


3.5% 


Sugar 


3.0% 


Egg yolk 


2.0% 


Salt 


1.0% 


Potassium sorbate 


0.1% 


MAYODAN 525 


1.4% 


Acid masking flavouring 2072 


0.02% 



MAYODAN 525 provides unique emulsion stability, prevents syneresis, ensures 
unifonn oil disp^sion and viscosity, improves tolerance to production processes and 
ensures a long shelf life. 

25 Flavouring 2072 is a r - :ure-identical, acid masking flavouring reducing the acidulated 
taste of dressing without affecting its pH value. 

Process 



30 



1; 



Dry mbc MAYODAN 525, sugar and salt. Disperse in oil in a ratio of 1 part 
powder to 2 parts oil. 
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2, Fill flavouring, potassium sorbate and yogurt into the Koruma mixer. Add 1). 

3. Add the egg yolk. 

5 4. Add the oil continuously in a vacuum. 

5. After 2/3 of the oil has been added (slowly), blend the vinegar with the 
remaining 1/3 of the oil, and add. 

10 6. Add spices if required. 
Test results: 

Sample: IP hours ££ 

1. Blank 37.2 1.00 

IS 2. 500 ppm anhydrofructose 39.S 1.06 

3. 800 ppm GRINDOX 1032 43.3 1.07 
(BP - Induction Period); (PF - Protection Period) 

Protection Factor fPF): 
20 For each temperature defined as 

PF == IP of the oil with added antioxidant/IP of the same oil without added 
antioxidant 



Mfp fixt<msiQn aE) %: 
25 LE = (PF - 1.0) X 100 
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6. PREPARATIONS OF Q-1.4-GLUCAN LYASE 
INTRODUCTION 

5 With regard to a further embodiments of the present invention the enzyme 

glucan lyase for use in preparing the AF may be isolated from a ftingally infected 
algae, preferably fungally infected Gracilariopsis lemaneiformiSy more preferably 
fungally infected Gracilariopsis lemaneiformis from Qingdao (China). 

10 Alternatively the enzyme may be obtained from a fungus. For example, the fungus 
can be any one of Discina perlata, Discina parma, Gyromtra gigas, Gyromitra 
infida, Mitrophora hybrida, Morchella conica, Morchella costaia, Morchella elata, 
Morchella honensis, Morchella rotunda, Morchella vulgaris, Peaza badia, 
Sarcosphaera eximia, Disciotis venosa, Gyrormtra esadetua, Helvetia crispa, Helvetia 

IS laamosa, Leptopodia elastica, Verpa digitaliformis, and other forms of Morchella. 
Preferably the fungus is Morchella costata or Morchella vulgaris. 

With regard to a further embodiment of the present mvention the enzyme 0-1,4- 
glucan lyase for use in preparing the AF may be isolated from algae alone, preferably 
20 Gracilariopsis lemaneiformis, more preferably Gracilariopsis lemaneiformis from 
Santa Cruz (California)* 

The initial enzyme purification can be performed by the method as described by Yu 
et al (ibid). However, preferably, the initial enzyme purification includes an 
25 optimized procedure in which a solid support is used that does not decompose under 
the purification step. This gel support further has the advantage that it is compatible 
with standard laboratory protein purification equipment. The details of this optimized 
purification strategy are given later on. The purification is terminated by known 
standard techniques for protein purification. 

30 

The purity of the enzyme can be readily established using complementary 
electrophoretic techniques. 
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A. SOURCE = FIJNGALLY INFECTED ALGAE 

The following sequence information was used to generate primers for the PGR 
reactions mentioned below and to check the amino acid sequence generated by the 
5 respective nucleotide sequences. 

Amino acid sequence assembled from peptides from fungus infe cted Gracilariop s^ 
lemaneiformis 

10 Tyr Arg Trp Gin Glu Val Leu Tyr Thr Ala Met Tyr Gin Asn Ala 
Ala Phe Gly Lys Pro He lie Lys Ala Ala Sct Met Tyr Asn Asn 
Asp Ser Asn Val Arg Arg Ala Gin Asn Asp His Phe Leu Leu Gly 
Gly ffis Asp Gly Tyr Arg Ue Leu Cys Ala Pro Val Val Trp Glu 
Asn Ser Thr Glu Arg Glu Leu Tyr Leu Pro Val Leu Thr Gin Trp 

15 Tyr Lys Phe Gly Pro Asp Phe Asp Thr Lys Pro Leu Glu Gly Ala 

The Amino acid sequence (27-34^ used to generate primer A and B (Met Tyr Asn 
Asn Asp Ser Asn Val^ 

20 Primer A 

ATG TACrC) AA(CT) AA(CT) GA(CI) TC(GATC) AA(CT) GT 128 mix 

Primer B 

ATG TA(TC) AA(CT) AA(CT) GA(CT) AG(CT) AA(CT) GT 64 mix 

25 

The Amino acid sequence (45-50) used to generate primer C (Gly Gly His Asp Gly 
Primer C 

30 TA (GATC)CC (GA)TC (GA)TG (GATC)CC (GATC)CC 256 mix 
[The sequence corresponds to the complementary strand J 
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The Amino acid sequence (74-79) used to generate primer E (Gig Trp Tyr Lys Phe 

om . ' 

Primer E 

5 GG(GATC) CC(GA) AA(CT) TT(GA) TAG CA(CT) TG 64 mix 
[The sequence corresponds to the complementary strand.] 

The Amino acid sequmce (1-6) used to generate primer Fl and F2 flyr Arp Trp Gin 
Glu Val> 

10 

Primer Fl 

TA(TQ CG(GATC) TOG CA(GA) GA(GA) GT 32 mix 
Primer F2 

15 TA(TC) AG(GA) TGG CA(GA) GA(GA) GT 16 mix 

The sequence obtained from the first PCR amplification (clone 

ATGTACAACA AOGACTCGAA CXHTCGCAGG GCGCAGAACG ATCATTTCCT 
20 TCTTGGCGGC CACGACGGTT A 

Met Tyr Asn Asn Asp Ser Asn Val Arg Arg Ala Gin Asn Asp His Phe Leu Leu Gly 
Gly His Asp Gly 

25 The sequence obtained from the second PCR amplification (clone 1^ 

ATGTACAACA ACGACTCGAA CGTTCGCAGG GCGCAGAACG ATCATTTCCT 
TCTTGGTGG A CATGATGGAT ATCGCATTCT GTGCGCGCCT GTTGTGTGGG 
AGAATTCGAC CGAACGNGAA TTGTACTTGC CCGTGCTGAC CCAATGGTAC 
AAATTCGGCC C 

30 Met Tyr Asn Asn Asp Ser Asn Val Arg Arg Ala Gin Asn Asp His Phe Leu Leu Gly 
Gly His Asp Gly Tyr Ai;g He Leu Cys Ala Pro Val Val Trp Glu Asn Ser Thr Glu 
Arg Glu Leu Tyr Leu Pro Val Leu Thr Gin Trp Tyr Lys Phe Gly Pro 
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The sequence obtained from the third PGR amplification fclonc2) 
TACAGGTGGC AGGAGGTGTT GTACACTGCT ATGTACCAGA 
ATGCGGCTTT CGGGAAACCG ATTATCAAGG CAGCTTCCAT 
GTACGACAAC GACAGAAACG TTCGCGGCGC ACAGGATGAC 
5 CACTTCCTTC TCGGCGGACA CGATGGATAT CGTATTTTGT 

GTGCACCTGT TGTGTGGGAG AATACAACCA GTCGCGATCT 
GTACTTGCCT GTGCTGACCA GTGGTACAAA TTCGGCCC 

Tyr Arg Tip Gin Glu Val Leu Tyr Thr Ala Met Tyr Gin Asn Ala Ala Phe Gly Lys 
10 Pro He He Lys Ala Ala Ser Met Tyr Asp Asn Asp Arg Asn Val Ai:g Gly Ala Gin Asp 
Asp His Phe Leu Leu Gly Gly His Asp Gly Tyr Arg He Leu Cys Ala Pro Val Val 
Tip Glu Asn Thr Thr Ser Arg Asp Leu Tyr Leu Pro Val Leu Thr Lys Tip Tyr Lys 
Phe Gly 

15 A.l, CYTOLOGICAL INVESTIGATIONS OF GRACILARIOPSIS 

LEMANEIPORMIS 

Detection of fungal infection in Gracilariopsis lemaneifomis 

20 Sections of Gracilariopsis lemaneiformis collected in China were either hand cut or 
cut from paraffin embedded material. Sectioned material was carefully investigated 
by Ught microscopy. Fungal hyphae were clearly detected in Gracilariopsis 
lemaneiformis. 

25 The thaUi of the Gracilariopsis lemaneiformis are composed of cells appearing in a 
highly ordo'ed and almost symmetric manner. The tubular thallus of G. 
lemaneiformis is composed of large, colourless central cells surrounded by dongated, 
slender, ellyptical cells and small, round, red pigmented peripherial cells. All algal 
cell types are characterized by thick cell walls. Most of the fungal hyphae are found 

30 at the interphase between the central layer of large cells and the peripherial layer. 
These cells can clearly be distinguished from the algae cells as they are long and 
cylindrical. The growth of the hyphae is observed as irregularities between the highly 
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ordered algae cells. The most frequent orientation of the hypha is along the main 
axis of the algal thallus. Side branches toward the central and periphery are detected 
in some cases. The hypha can not be confused with the endo/epiphytic 2nd generation 
of the algae. 

5 

Calcofluor White is known to stain chitin and cellulose containing tissue. The reaction 
with chitin requires four covalently linked terminal n-acetyl glucosamine residues. 
It is generally Bcceptcd that cellulose is almost restricted to higher plants although it 
might occur in trace amounts in some algae. It is further known that chitin is absent 
10 in Gracilaria. 

Calcofluor White was found to stain domains corresponding to fimgi hyfa cell walls 
in sectioned Gracilariopsis lemanelfomis material. 

IS The hypha appear clear white against a faint blue background of Gracilaria tissue 
when observed under u.v. light - see Figure L Chitin is the major cell wall 
component in most fungi but absent in Gracilaria. Based upon these observations we 
conclude that the investigated algae is infected by a fungi. 40% of the lower parts 
of the investigated Gracilariopsis lemaneiformis sections were found to be infected 

20 with fungal hyphae. In the algae tips 25% of the investigated Gracilariopsis 
lemaneiformis sections were found to be infected. 

Staining of sectioned Gracilariopsis lemaneiformis with Periodic acid Schiff (PAS) 
and Aniline blue black revealed a significantly higher content of carbohydrates within 
25 the fungal cells as compared with the algae cells - see Figure 2. Safranin O and 
Maladut Green showed the same colour reaction of fungi cells as found in higher 
plants infected with fungi. 

An Acridin Orange reaction widi sectioned Gracilariopsis lemaneiformis showed 
30 clearly the irregularly growth of the fungus. 
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A . 1 . 1 . 2 Electron Microscopy 

Slides with 15 fun thick sections, where the fungus was detected with Calcofluor 
White were fixed in 2% OSO4, washed in water and dehydrated in dimethoxypropane 
5 and absolute alcohol. A drop of a 1 : 1 mixture of acetone and Spun* resin was placed 
over each section on the glass slide, and after one hour replaced by a drop of pure 
lesin. A gelatin embedding capsule filled with resin was placed face down over the 
section and left over night at 4T. After the polymerization at SS^'C for 8 hrs, the 
thick sections adhering to the resin blocks could can be separated from the slide by 
10 immersion in liquid nitrogen. 

Blocks were trimmed and 100 nm thick sections were cut using a diamond knife on 
a miciotome. The sections were stained in aqueous uranyl acetate and in lead citrate. 
The sections were examined in an election microscope at 80 kV. 

15 

The investigation confirmed the ligth microscopical observations and provided further 
evidence that the lyase producing, Chinese strain of G. lamneiformis is infected by a 
fungal parasite or symbiont. 

20 Fungal hyphae are build of tubular cells SO to 100 fim long and only few microns in 
diameter. The cells are serially arranged with septate walls between the adjacent cells. 
Ocasional branches are also seen. The hyphae grow between the thick cell walls of 
algal thallus without penetratmg the wall or damaging the cell. Such a symbiotic 
association, called mycophycobiosis, is known to occur between some filamentous 

25 marine fungi and large marine algae (Donk and Bnming, 1992 - Ecology of aquatic 
fiingi in and on algae. In Reisser, W.(ed.): Algae and Symbioses: Plants, Animals, 
Fungi, Viruses, Interacdons Explored. Biopress Ltd., Bristol.) 

Examining the microphotograph in Figure 10, several differences between algal and 
30 fungal cells can be noticed. In contrast to several fim thick walls of the alga, the 
fungal walls are only 100-200 nm thick. Plant typical organells as chloroplasts with 
thyllacoid membranes as well as floridean starch grains can be seen in algal cells, but 
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not in the fungus. 

Intercellular connections of red algae are characterized by specific structures termed 
pit plugs, or pit connections The structures are prominent, electron dense cores and 
5 they are important features in algal taxonomy (Pueschel, CM.: An expanded survey 
of the ultrastructure of Red algal pit plugs. J. PhycoL 25, 625, (1989)). In our 
material, such connections were frequently observed in the algal thallus, but never 
between the cells of the fiingus. 

10 A.L2 in situ Hybridization experiments 

In situ hybridization technique is based upon the principle of hybridization of an 
antisense ribonucotide sequence to the mKNA. The technique is used to visualize 
areas in microscopic sections where said mRNA is present. In this particular case the 
IS technique is used to localize the enzyme Qf-l,4-glucan lyase in sections of 
Gracilariopsis lemaneiformis. 

A.L2.1 PrqKuaticm of ^^S labelled probes for In situ hybridization 

20 A 238 bp PGR fragment from a third PGR amplification - called clone 2 (see above) - 
was cloned into the pGEM-3Zf(+) Vector (Promega). The transcription of the 
antismse RNA was driven by the SP6 promoter, and the sense RNA by the T7 
promotor. The Ribonuclease protection assay kit (Ambion) was used with the 
following modifications. The transcripts were run on a 6% sequencing gel to remove 

25 the unincorporated nucleotide and eluted with the elution buffer supplied with the 
T7RNA polymerase in vitro Transcription Kit (Ambion). The antisense transcript 
contained 23 non-coding nucleotides while the sense contained 39. For hybridization 
10^ cpm/ml of the ^^S labelled probe was used. 

30 In situ hybridisation was performed essentially as described by Langedale 
et.al.(1988). The hybridization temperature was found to be optimal at 45**C. After 
washing at 45T the sections were covered with KodaK K-5 photographic emulsion 
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and left for 3 days at 5^C in dark (Rcf: Langedale, J. A., Rothermel, B.A. and 
Nelson, T. (1988). Genes and development 2: 106-115, Cold Spring Harbour Labora- 
tory), 

5 The in situ hybridization experiments with riboprobes against the mRNA of a-1,4- 
glucan lyase, show strong hybridizations over and around the hypha of the fungus 
detected in Gracilariopsis lenumeifamis - see Figures 4 and 5. This is considered 
a strong indication that the a-l,4-glucan lyase is produced. A weak random 
background reactions were detected in the algae tissue of both Gracilariopsis 
10 lemaneiformis. This reaction was observed both with the sense and the antisense 
probes. Intense staining over the fiingi hypha was only obtained with antisense 
probes. 

TTiese results were obtained with standard hybridisation conditions at 45^C in 
15 hybridization and washing steps. At 50°C no staining over the fungi was observed, 
whereas the background staining remained the same. Raising the temperature to 55**C 
reduced the background staining with both sense and antisense probes significantly 
and equally. 

20 Based upon the cytological investigations using complementary staining procedures 
it is concluded that Gracilariopsis lemaneiformis is fungus infected. The infections 
are most pronounced in the lower parts of the algal tissue. 

In sectioned Gracilariopsis lemaneiformis material in situ hybridization results clearly 
25 indicate that hybridization is restricted to areas where fungal infections are found - 
see Figure 4. The results indicate that a-l,4-glucan lyase mRNA appears to be 
restricted to fungus infected areas in Gracilariopsis lemaneiformis. Based upon these 
observations we conclude that Q-l,4-glucan lyase activity is detected in fungally 
infected Gracilariopsis lemaneiformis. 



30 
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ENZYME PURIFICATION AND CHARACTERIZATION 

Purification of a-l,4-glucan lyase from fungal infected Gracilariopsis lemaneiformis 
material was performed as follows. 

5 

A.2« 1 Materials and Methods 

The algae were harvested by filtration and washed with 0.9% NaCL The cells were 
broken by homogenization followed by sonication on ice for 6x3 min in SO mM 
10 citrate-NaOH pH 6.2 (Buffer A). Cell debris were removed by centrifugation at 
2S,000xg for 40 min. The supernatant obtained at this procedure was regarded as 
ceU-free extract and was used for activity staining and Western blotting after 
separation on 8-25% gradient gets. 

15 A.2.2 Separation by jS-cyclodextrin Sq)harose gel 

The cell-free extract was applied directly to a j3-cyclodextrin Sepharose gel 4B 
clolunm ( 2.6 x 18 cm) pre equilibrated with Buffer A. The column was washed 
with 3 volumes of Buffer A and 2 volumes of BvSfcx A containing 1 M NaCl. a-1 ,4- 
20 glucan lyase was eluted with 2 % dextrins in Buffer A. Active fractions were pooled 
and the buffer changed to 20 mM Bis-tris propane-HCl (pH 7.0, Buffer B). 

Active fractions were applied onto a Mono Q HR 5/5 column pre-equilibrated with 
Buffer B. The fungal lyase was eluted with Buffer B in a linear gradient of 0.3 M 
25 NaCL 

The lyase preparation obtained after /3-cyclodextrin Sepharose chromatography was 
alternatively concentrated to ISO fil and applied on a Superose 12 column operated 
under FPLC conditions. 

30 
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A. 2. 3 Assay for a-l,4-glucan lyase activity and conditions for determination of 
substrate specificity, pH and temperature optimum 

The reaction mixture for the assay of the a-1 ,4-glucan lyase activity contained 10 mg 
5 ml * amylopectin and 25 mM Mes-NaOH (pH 6.0). The reaction was carried out at 
30^C for 30 min and stopped by the addition of 3,5-dinitrosalicylic acid reagent. 
Optical density at SSOnm was measured after standing at room temperature for 10 
min* 

10 A3, AMINO ACID SEQUENCING OF THE <^1.4-GLUCAN LYASE FROM 
FUNGUS INFECTED GRACILARIOPSIS LEMANEIFORMIS 

A.3. 1 Amino acid sequencing of the lyases 

15 The lyases were digested with either endoproteinase Arg-C from Clostridium 
histolyricum or endoproteinase Lys-C from Lysobacter enzymogenes^ both sequencing 
grade purchased from Boehringer Mannheim, Germany. For digestion with 
endoproteinase Arg-C, freeze dried lyase (0,1 mg) was dissolved in 50 ftl 10 M urea, 
50 mM methylamine, 0.1 M Tris-HCl, pH 7.6. After overlay with Nj and addition 

20 of 10 of 50 mM DTT and 5 mM EDTA the protein was denatured and reduced for 
10 min at SOfC under N2. Subsequently, 1 /ig of endoproteinase Arg-C in 10 of 50 
mM Tiis-HCl, pH 8.0 was added, was overlayed and the digestion was carried out 
for 6h at 37*^. For subsequent cysteine derivatization, 12.5 ^1 100 mM iodoaceta- 
mide was added and the solution was incubated for IS min at RT in the dark under 

25 N2. 

For digesdon with endoproteinase Lys-C, freeze dried lyase (0.1 mg) was dissolved 
in SO fd of 8 M urea, 0.4 M NH4HCO3, pH 8.4. After overlay with and addition 
of 5 fil of 45 mM DTT, the protein was denatured and reduced for 15 min at 5QPC 
30 under Nj. After cooling to RT, 5 of 1(X) mM iodoacetamide was added for the 
cysteines to be deiivatized for 15 min at RT in the dark under Nj. 
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Subsequently, 90 /xl of water and 5 /xg of endoproteinase Lys-C in 50 /il of 50 mM 
tricine and 10 mM EDTA, pH 8.0, was added and the digestion was carried out for 
24h at 3TC under Nj. 

5 The resulting peptides were separated by reversed phase HPLC on a VYDAC CI 8 
column (0.46 x IS cm; 10 ^m; The Separations Group; California) using solvent A: 
0.1 % TFA in water and solvent B: 0. 1 % TFA in acetonitrile. Selected peptides were 
rechromatographed on a Develosil CIS column (0.46 x 10 cm; 3 iim; Dr. Ole Schou, 
Novo Nordisk, Dramark) using the same solvent system prior to sequencing on an 
10 Applied Biosystems 476A sequencer using pulsed-liquid fast cycles. 

The amino acid sequence information from the enzyme derived from fungus infected 
Gracilariopsis lemaneifomis is shown below, in particular SEQ. ID. No. 1. and 
SEQ. ID. No. 2. 

15 

gPQ. I,D, No- 1 h^; 
Number of residues : 1088. 

Amino acid composition (including the signal sequence) 



20 


61 Ala 


15 Cys 


19 His 


34 Met 


78Thr 




51 Arg 


42 Gin 


43 lie 


53Phe 


24 Tip 




88 Asn 


53 Glu 


63 Uu 


51 Pro 


58Tyr 




79 Asp 


100 Gly 


37Lys 


62 Ser 


77 Val 



25 SEQ, y.D, NPn 2 has: 

Number of residues : 109L 

Amino acid composition (including the signal sequence) 



58 Ala 


16 Cys 


14 His 


34 Met 


68Thr 


57 Arg 


40 Gin 


44 Ue 


S6Phe 


23Trp 


84 Asn 


47 Glu 


69 Leu 


51 Pro 


61 Tyr 


81 Asp 


102 Gly 


50Lys 


60 Ser 


76 Val 
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A.3.2 N TERMINAL ANALYSIS 

Studies showed that the N-terminal sequence of native giucan lyase 1 was blocked. 
Deblocking was achieved by treating giucan lyase 1 blotted onto a PVDF membrane 
5 with anhydrous TFA for 30 min at 4(f C essentially as described by LcGendre et al. 
(1993) [Purification of piotdns and peptides by SDS-PAGE; In: Matsudaira, P. (ed.) 
A practical guide to protein and peptide purification for microsequencing, 2nd edition; 
Academic Press Inc., San Diego; pp. 74-101.]- The sequence obtained was 
TALSDKQTA, which matches the sequence (sequence position from SI to 59 of 
10 SEQ. I.D. No.l) derived from the clone for giucan lyase 1 and indicates N- 
acetylthreonine as N-terminal residue of giucan lyase 1. Sequence position 1 to 50 
of SEQ. I.D. No. 1 represents a signal sequence. 

A.4. DNA SEQUENCING OF GENES CODING FOR THE iy-1.4^GLUCAN 
15 LYASE FROM FUNGUS INFECTED GRACILARIOPSIS LEMANEIFORMIS 

KAA METHODS FOR MOLECULAR BIOLOGY 

DNA was isolated as described by Saunders (1993) with the following modification: 
20 The polysaccharides were rraioved from the DNA by ELUTIP-d (Schleicher & 
Schuell) purification instead of gel purification. (ReftSaunders, G.W. (1993). Gel 
purification of red algal genomic DNA: An inexpensive and rapid method for the 
isolation of PCR-friendly DNA. Journal of phycology 29(2): 251-254 and Schleicher 
& Schuell: ELUTIP-d. Rapid Method for Purification and Concentration of DNA.) 

25 

A.4.2 PCR 

The preparation of the relevant DNA molecule was done by use of the Gene Amp 
DNA Amplification Kit (Pcrkin Elmer Cetus, USA) and in accordance with the 
30 manufactures instructions except that the Taq polymerase was added later (see PCR 
cycles) and the temperature cycling was changed to the following: 
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PCR cycles: 

no of cycles 



C 



time (mxn.) 



1 



98 



5 



60 



5 



addition of Taq polymerase and oil 



35 



94 



1 



47 



2 



72 



3 



1 



72 



20 



A.4.3 CLONING OF PCR FRAGMENTS 

PCR fragments were cloned into pTTBlue (from Novagen) following the instructions 
of the supplier. 

A,4.4 DNA SEQUENCING 

Double stranded DNA was sequenced essentially according to the dideoxy method of 
Sanger et al. (1979) using the Auto Read Sequencing Kit (Pharmacia) and the 
Pharmacia LKB A.L.F.DNA sequencer. (Ref: Sanger, F., Niclden, S. and Coulson, 
A.R.(1979). DNA sequencing with chain-determinating inhibitors. Proc. Natl. Acad. 
Sci. USA 74: 5463-5467.) 

Hie sequences are shown as SEQ.I.D. No.s 1 and 2. In brief: 

SEP, LP. No, 3 has: 
Total number of bases: 3267. 

DNA sequence composition: 850 A; 761 C; 871 G; 785 T 

SEP. I.D. No. 4 has: 
Total number of bases: 3276. 

DNA sequence composition: 889 A; 702 C; 856 G; 829 T 
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A.4.5 SCREENING OF THE LIBRARY 

Screening of the Lambda Zap library obtained from Stratagene, was performed in 
accordance with the manufacturer's instructions except that the prehybridization and 
5 hybridization was performed in 2xSSC, 0.1% SDS, lOxDenhardt's and lOO^g/ml 
denatured salmon sperm DNA. To the hybridization solution a 32P-labeled denatured 
probe was added. Hybridization was performed over night at SS^'C. The filters were 
washed twice in 2xSSC, 0.1 % SDS and twice in IxSSC, 0.1% SDS. 

10 A.4.6 PROBE 

The cloned PGR fragments were isolated from the pT7 blue vector by digestion with 
appropriate restriction enzymes. The fragments were seperated from the vector by 
agarose gel electrophoresis and the fragments were purified from the agarose by 
15 Agarase (Boehringer Mannheim). As the fragments were only 90-240 bp long the 
isolated fragments were exposed to a ligation reaction before labelling with 32P-dCTP 
using either Prime-It random primer kit (Stratagene) or Ready to Go DNA labelling 
kit (Pharmacia). 

20 A.4.7 RESULTS 

A.4.7.1 Generation of PGR DNA fragments coding for a-l,4-glucan lyase. 

The amino acid sequences of three overlapping tryptic peptides from a-l,4-glucan 
25 lyase were used to generate mixed oligonucleotides, which could be used as PGR 
primers (see the sequences given above). 

In the first PGR amplification primers A/B (see above) were used as upstream 
primers and primer C (see above) was used as downstream primer. The size of the 
30 expected PGR product was 71 base pairs. 
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In the second PGR amplification primers A/B were used as upstream primers and E 
was used as downstream primer. The size of the expected PGR product was 161 base 
pairs. 

5 In the third PGR amplification primers Fl (see above) and F2 (see above) were used 
as upstream primers and E was used as downstream primer. The size of the expected 
PGR product was 238 base pairs. 

The PGR products were analysed on a 2% LMT agarose gel and fragments of the 
10 expected sizes were cut out from the gel and treated with Agarase (Boehringer 
Manheim) and cloned into the pTTblue Vector (Novagen) and sequenced. 

The cloned fragments from the first and second PGR amplification coded for amino 
acids corresponding to the sequenced peptides (see above). The clone from the third 
IS amplification (see above) was only about 87% homologous to the sequenced peptides. 

A.4.7.2 Screening of the genomic library with the cloned PCR fragments. 

Screening of the library with the above-mentioned clones gave two clones. One clone 
20 contained the nucleotide sequence of SEQ I.D. No. 4 (gene 2). The other clone 
contained some of the sequence of SEQ I.D. No.3 (from base pair 1065 downwards) 
(gene 1). 

The S' end of SEQ. I.D. No. 3 (i.e. from base pair 1064 upwards) was obtained by 
25 the RACE (rapid amplification of cDNA ends) procedure (Michael, A.F., Michael, 
K.D. Sl Martin, G.R.(1988). Proc..Naa.Acad.Sci.USA 85:8998-99002.) using the 
5' race system from Gibco BRL. Total RNA was isolated according to Collinge et 
al.(Collinge, D.B., Milligan D.E:, Dow, J.M., Scofield, G.& Daniels, M.J.(1987). 
Plant Mol Biol 8: 405-414). The 5' race was done according to the protocol of the 
30 manufacturer, using l^g of total RNA. The PCR product from the second 
ammplification was cloned into pTTblue vector from Novagen according to the 
protocol of the manufacturer. Three independent PCR clones were sequenced to 



PCT/EP94/03397 



51 

compensate for PGR errors. 

An additional PGR was performed to supplement the clone just described with Xbal 
and Ndel restriction sites immediately in front of the ATG start codon using the 
5 following oligonucleotide as an upstream primer: 
GCTCTAGAGCAieTnTCAACCCTTGCG 

and a primer containing the complement sequence of bp 1S73-1S93 in sequence GLl 
(i.e. SEQ. I.D. No. 3) was used as a downstream primer. 

10 The complete sequence for gene 1 (i.e. SEQ. I.D. No. 3) was generated by cloning 
the 3' end of the gene as a BamHI-Hindlll fragment from the genomic done into the 
pBlucscript II KS-f vector from Stratagene and additionally cloning the PGR 
generated S' end of the gene as a Xbal-BamHI fragment in front of the 3' end. 

15 Gene 2 was cloned as a Hindlll blunt ended fragment into the EcoRV site of 
pBluescript II SK-f vector from Stratagene. A part of the 3* untranslated sequence 
was removed by a SacI digestion^ followed by religation. Hindlll and Hpal 
restriction sites were introduced immediately in front of the start ATG by digestion 
with Hindlll and Narl and religation in the presence of the following annealed 

20 oligonucleotides 

AGCTTGTTAACATGTATCCAACCCTCACCTTCGTGG 

ACAATTGTACATAGGTTGGGAGTGGAAGCACCGC 

25 No introns were found in the clones sequenced. 

The clone I type (SEQ.ID.No.3) can be aligned with all ten peptide sequences (see 
Figure 8) showing 100% identity. Alignment of the two protein sequences encoded 
by the genes isolated from the fungal infected algae Gracilariopsis lemaneiformis 
30 shows about 78% identity, indicating that both genes are coding for a a-1.4-glucan 
lyase. 
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A. 5. EXPRESSION OF THE GL GENE IN MICRO ORGANISMS 

(E-G. ANALYSES OF PICIIIA LYASE TRANSFORMANTS AND 

ASPERGILLUS LYASE TRANSFORMANTS) 

S The DNA sequence encoding the GL was introduced into microorganisms to produce 
an enzyme with high specific activity and in large quantities. 

In this regard, gene 1 (i.e. SEQ. LD« No. 3) was cloned as a Notl-Hindlll blunt 
ended (using the DNA blunting kit from Amersham International) fragment into the 
10 lochia expression vector pHIL-D2 (containing the AOXl promoter) digested with 
EcoRI and blunt ended (using the DNA blunting kit from Amersham International) 
for expression in Pichia pastom (according to the protocol stated in the Pichia 
Expression Kit supplied by Invitrogen). 

15 In another embodiment, the gene 1 (i,e. SEQ. LD. No. 3) was cloned as a Notl- 
Hindlll blunt ended fragment (using the DNA blunting kit from Amersham 
International) into the Aspergillus expression vector pBARMTEI (containing the 
methyl tryptophan resistance promoter from Neuropera crassa) digested with Smal 
for expression in Aspergillus rUger (Pall et al (1993) Fungal Genet Newslett. vol 40 

20 pages 59-62). The protoplasts were prepared according to Daboussi et al (Curr Genet 
(1989) vol 15 pp 453-456) using lysing enzymes Sigma L-2773 and the lyticase Sigma 
L-8012. The transformation of the protoplasts was followed according to the protocol 
stated by Buxton et al (Gene (1985) vol 37 pp 207-214) except that for plating the 
transformed protoplasts the protocol laid out in Punt et al (Methods in Enzymology 

25 (1992) vol 216 pp 447 - 457) was followed but with the use of 0.6% osmotic 
stabilised top agarose. 

The results showed that lyase activity was observed in the transformed Pichia pastoris 
and Aspergillus nigen 

30 
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A.5.1 GENERAL METHODS 

Preparation of cell-free extracts. 

5 The cells were harvested by centrifugation at 9000 ipm for 5 min and washed with 
0.9% NaCl and resuspended in the breaking buffer (SOniM K-phosphate, pH 7.5 
containing ImM of EDTA, and 5% glycerol). Cells were broken using glass beads 
and vortex treatment. The breaking buffer contained 1 mM PMSF (protease inhibi- 
tor). The lyase extract (supernatant) was obtained after centrifugation at 9000 rpm for 
10 5 min followed by centrifugation at 20»000 xg for 5min. 

Assay of lyase activity by alkaline 3,5-dinitrosaIicyIic acid reagent (DNS) 

One volume of lyase extract was mixed with an equal volume of 4% amylopectin 
15 solution. The reaction mixture was then incubated at a controlled temperature and 
samples vere removed at specified intervals and analyzed for AF. 

The lyase activity was also analyzed using a radioactive method. 

20 The reaction mixture contained 10 pA '^C-starch solution (1 fiCi; Sigma Chemicals 
Co.) and 10 ^1 of the lyase extract. The reaction mixture was left at 25*C overnight 
and was dien analyzed in the usual TLC system. The radioactive AF produced was 
detected using an Instant Imager (Pachaid Instrument Co., Inc., Meriden, CT). 

25 Electrophoresis and Western blotting 

SDS'PAGE was performed using 8-25% gradient gels and the PhastSystem 
(Pharmacia). Western blottings was also run on a Semidry transfer unit of the 
PhastSystem. 

30 

Primary antibodies raised against the lyase purified from the red seaweed collected 
at Qingdao (China) were used in a dilution of 1:100. Pig antirabbit IgG conjugated 
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to alkaline phosphatase (Dako A/S, Glostrup, Denmark) were used as secondary 
antibodies and used in a dilution of 1 : 1000, 

Part I, Analysis of the Pichia transformantscontaiiung the above mentioiied 
5 construct 

Results: 

1 . Lyase activity was detennined 5 days after induction (according to the manual) and 
10 proved the activity to be intraceUular for all samples in the B series. 

Samples of B series: 11 12 13 IS 26 27 28 29 30 



Specific activity: 139 81 122 192 151 253 199 198 150 

*Specific activity is defined as nmol AF released per min per mg protein in a reaction 
mixture containing 2% (w/v) of glycogen, 1% (w/v) glycerol in 10 mM potassium 
phosphate buffer (pH 7.5). The reaction temperature was 4S''C; the reaction time was 
60 min. 

20 

A time course of sample B27 is as follows. The data are also presented in Figure 1. 
Time (min) 0 10 20 30 40 50 60 



25 Spec. act. 0 18 54 90 147 179 253 



Assay conditions were as above except that die time was varied. 
2. Western-blotting analysis. 

30 

The CPE of all samples showed bands with a molecular weight corresponding to the 
native lyase. 
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MC-Lyase expressed intracellularly in Pichia pastoris 



Names of culture 


Specific activity* 


A18 


10 


A20 


32 


A21 


8 


A22 


8 


A24 


6 



Fart n, The AspergUus transf ormants 
Results 

20 

L Lyase activity was determined after S days incubation(ininiinaI medium containing 
0.2% casein enzymatic hydrolysate analysis by the alkaline 3,5-dinitrosalicylic acid 
reagent 

25 1). Lyase activity analysis of the culture medium 

Among 35 cultures grown with 0.2% amylopectin included in the culture medium, 
AF was only detectable in two cultures. The culture medium of 5.4-h and 5.9-f 
contained 0.13 g AF/liter and 0.44 g/liter, respectively. The result indicated that 
30 active lyase had been secreted from the cells. Lyase activity was also measurable in 
the cell-free extract. 
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2). Lyase activity analysis in cell-free extracts 



Name of the culture 


Specific activity* 


5.4+ 


51 


5.9+ 


148 


5.13 


99 


5.15 


25 


5.19 


37 



*The specific activity was defined as nmol of AF produced per min per mg protein 
at 25'*C» 4- indicates that 0.2% amylopectin was added. 

The results show that Gene 1 of GL was expressed intracellular in A. niger. 

20 

Experiments with transformed E.coli (using cloning vectors pQE30 from the Qia 
express vector kit from Qiagen) showed expression of enzyme that was recognised 
by anti-body to the enzyme purified from fungally infected Gmcilariopsis 
lemaneiformis. 



25 
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B, SOURCE = FUNGUS 

B.U ENZYME PURIFICATION AND CHARACTERIZATION OF TIIE cy-1.4- 
GLUCAN LYASE FROM THE FUNGUS MORCH ELLA COSTATA 

5 

B.1.1 Materials and Methods 

The fungus Morchella cosiota was obtained from American Type Culture Collection 
(ATCC). The fungus was grown at 25X on a shaker using the culture medium 
10 recommended by ATCC. The mycelia were harvested by filtration and washed with 
0.9% NaCl. 

The fungal cells were broken by homogenization followed by sonication on ice for 
6x3 min in 50 mM citrate-NaOH pH 6.2 (Buffer A). Cell debris were removed by 
15 centrifugation at 25,000xg for 40 min. The supernatant obtained at this procedure 
was regarded as cell-free extract and was used for activity staining and Western 
blotting after separation on 8-25% gradient gels. 

B.L2 Separation by jS-cyclodextrin Sepharose gel 

20 

The cell-free extract was applied directly to a i3-cyclodextrin Sepharose gel 4B 
clolunm ( 2.6 x 18 cm) pre equilibrated with Buffer A. The column was washed widi 
3 volumes of Buffer A and 2 volumes of Buffer A containing 1 M NaCl. a-l,4- 
glucan lyase was eluted with 2 % dextrins in Buffer A. Active fractions were pooled 
25 and the buffer changed to 20 mM Bis-tris propane-HCl (pH 7.0, Buffer B). 

Active fractions were applied onto a Mono Q HR 5/5 column pre-equilibrated with 
Buffer B. The fungal lyase was eluted with Buffer B in a linear gradient of 0.3 M 
NaCl. The lyase preparation obtained after /3-cyclodcxtrin Sepharose chromatography 
30 was alternatively concentrated to 150 fil and applied on a Superose 12 column 
operated under FPLC conditions. 
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B. 1.3 Assay for a-l,4-glucan lyase activity and conditions for determination of 
substrate specificity, pH and temperature optimum 

The reaction mixture for the assay of the a-l,4-glucan lyase activity contained 10 mg 
5 ml * amylopectin and 25 mM Mes-NaOH (pH 6,0). 

The reaction was carried out at 30 for 30 min and stopped by the addition of 3,5- 
dinitrosalicylic acid reagent. Optical density at 550nm was measured after standing 
at room temperature for 10 min. 10 mM EDTA was added to the assay mixture 
10 when cell-free extracts were used. 

The substrate amiylopectin in the assay mixture may be replaced with other substrates 
and the reacdon temperature may vary as spedfied in the text. 

15 In the pH optimum investigations, the reaction mixture contained amylopection or 
maltotetraose 10 mg ml"^ in a 40 mM buffer. The buffers used were glycine-NaOH 
(pH 2.0-3.5), HoAc-NaoAc (pH 3.5-5.5), Mes-NaOH (pH 5.5-6.7), Mops-NaOH 
(6.0-8,0) and bicine-NaOH (7.6-9.0). The reactions were carried out at 30^ for 30 
min. The reaction conditions in the temperature optimum investigations was the same 

20 as above except that the buffer Mops-NaOH (pH 6.0) was used in all exp^ments. 
The reacdon temperature was varied as indicated in the text. 

SDS-PAGE, Native-PAGE and isoelectrofocusing were performed on PhastSystem 
(Pharmacia, Sweden) using 8-25% gradient gels and gels with a pH gradient of 3-9, 
25 respectively. Following electrophoresis, the gels were stained by silver staining 
according to the procedures recommended by the manufacturer (Pharmacia). The 
glycoproteins were stained by PAS adapted to the PhastSystem. For activity staining, 
the electrophoresis was performed under native conditions at 6^C. 

30 Following the electrophoresis, the gel was incubated in the presence of 1 % soluble 
starch at 3(fC overnight. Activity band of the fungal lyase was revealed by staining 
with VKI solution. 
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B. 1.4 Results 

B. 1.4.1 Purification, molecular mass and isoelectric point of the a-l,4-glucan lyase 

5 The fungal lyase was found to adsorb on columns packed with i8-cyclodextrin 
Sepharose, starches and Red Sepharose. Columns packed with i3-cyclodextrin 
Sepharose 4B gel and starches were used for purification purposes. 

The lyase preparation obtained by this step contained only minor contaminating 
10 proteins having a molecular mass higher than the fungal lyase. The impurity was 
either removed by ion exchange chromatography on Mono Q HR 5/5 or more effici- 
ently by gel filtration on Superose 12. 

The purified enzyme appeared colourless and showed no absorbance in the visible 
IS light region. The molecular mass was determined to 110 kDa as estimated on SDS* 
PAGE. 

The purified fungal lyase showed a isoelectric point of pi 5.4 determined by 
isoelectric focusing on gels with a pH gradient of 3 to 9. In the native 
20 electrophoresis gels, the enzyme appeared as one single band. This band showed 
starch-degrading activity as detected by activity staining. Depending the age of the 
culture from which the enzyme is extracted, the enzyme on the native and isoelectric 
focusing gels showed either as a sharp band or a more diffused band with the same 
migration rate and pi. 

25 

B. 1.4.2 The pH and temperature optimum of the fungal lyase catalayzed reaction 

The pH optimum pH range for the fungal lyase catalyzed reaction was found to be 
between pH 5 and pH 7. 

30 
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B.i.4.3 Substrate specificity 

The purified fungal lyase degraded maltosaccharides from maltose to maltoheptaose. 
However, the degradation rates varied. The highest activity achieved was with 
5 maltotetraose (activity as 100%), followed by maltohexaose (97%), maltoheptaose 
(76%), maltotriose (56%) and the lowest activity was observed with maltose (2%). 

Amylopectin, amylose and glycogen were also degraded by the fungal lyase (% will 
be determined). The fungal lyase was an exo-lyase, not a endolyase as it degraded 
10 p-nitrophenyl a-D-maltoheptaose but failed to degrade reducing end blocked p- 
nitrophenyl a-D-maltoheptaose. 

BA.5 Morx:hella Vulgaris 

15 The protocols for the enzyme purification and charaterisation of alpha 1 ,4-glucaI lyase 
obtained from Morchella Vulgaris were the same as those above for Morchella 
Costata (with similar results). 

3.2. AMINO ACID SgpygNClNG OF THE « -L 4-Q ]UUCA N L YASE F^Q M 
20 FUNGUS 

B.2.1 Amino acid sequencing of the lyases 

The lyases were digested with either endoproteinase Arg-C from Clostridium 
25 histofyticum or endoproteinase Lys-C from Lysobacter emymogenes, both sequencing 
grade purchased from Boehringer Mamiheim, Germany. For digestion with 
endoproteinase Arg-C, fieezedried lyase (0.1 mg) was dissolved in 50 fd 10 M urea, 
50 mM methylamine, O.l M Tris-HCl, pH 7.6* After overlay with N2 and addition 
of 10 fil of SO mM DTT and 5 mM EDTA the protein was denatured and reduced for 
30 10 min at 5(fC under N2. Subsequentfy, 1 ftg of endoproteinase Arg-C in 10 of 50 
mM Tris-HCl, pH 8.0 was added, N2 was overlayed and the digestion was carried out 
for 6h at 37*^0. 
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For subsequent cysteine derivatization, 12.5 /xl 100 mM iodoacetamide was added and 
the solution was incubated for 15 min at RT in the dark under Nj. 

For digestion with endoproteinase Lys-C, freeze dried lyase (0.1 mg) was dissolved 
5 in 50 ftl of 8 M urea, 0.4 M >fH4HC03, pH 8,4. After overlay with and addition 
of 5 fil of 45 mM DTT, the protein was denatured and reduced for 15 min at 5QPC 
under Nj. After cooling to RT, 5 fil of lOO mM iodoacetamide was added for the 
cysteines to be derivatized for 15 min at RT in the dark under Nj* Subsequently, 90 
fil of water and 5 /xg of endoproteinase Lys^C in 50 fil of 50 mM tricine and 10 mM 
10 EDTA, pH 8.0» was added and the digestion was carried out for 24h at 37C under 

The resulting peptides were sq>arated by reversed phase HPLC on a VYDAC C18 
column (0.46 x 15 cm; 10 fim; The Separations Group; California) using solvent A: 
15 0. 1 % TFA in water and solvent B: 0. 1 % TFA in acetonitrile. Selected peptides were 
rechromatographed on a Develosil C18 column (0.46 x 10 cm; 3 ftm; Dr, Ole Schou, 
Novo Nordisk, Denmark) using the same solvent system prior to sequencing on an 
Applied Biosystems 476A sequencer using pulsed-liquid fast cycles. 

20 The amino acid sequence information from the enzyme derived from the fungus 
Morchetta costata is shown Fig, 17. 

The amino acid sequence information from the enzyme derived from the fungus 
Morchella vulgaris is shown Fig. 18. 

25 

B,3. DNA SEQUENCING OF GENES CODING FOR THE €k-1,4-GLUCAN 
LYASE FROM FPNQVS 

B.3.1 METHODS FOR MOLECULAR BIOLOGY 



DNA was isolated as described by Dellaporte et al (1983 - Plant Mol Biol Rep vol 
1 PP19-21). 
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B,3.2 PGR 

The preparation of the relevant DNA molecule was done by use of the Gene Amp 
DNA Amplification Kit (Perkin Elmer Cetus, USA) and in accordance with the 
manufactures instructions except that the Taq polymerase was added later (see PGR 
cycles) and the temperature cycling was changed to the following: 



PGR cycles: 

no of cycles C time (xninO 

1 98 5 

60 5 
addition of Taq polymerase and oil 

35 94 1 

47 2 

72 3 

1 72 20 



B.3.3 CLONING OF PGR FRAGMENTS 

PGR fragmmts were cloned into pTTBlue (from Novagen) following the instructions 
of the supplier. 

B.3.4 DNA SEQUENGING 

Double stranded DNA was sequenced essentially according to the dideoxy method of 
Sanger et al. (1979) using the Auto Read Sequencing Kit (Pharmacia) and the 
Pharmacia LKB A.L.F.DNA sequencer. (Ref: Sanger, P., Nicklen, and Coulson, 
A.R.(1979). DNA sequencing with chain-determinating inhibitors. Proc. Natl. Acad. 
Sci. USA 74: 5463-5467.) 
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B,3.5 SCREENING OF THE LIBRARIES 

Screening of the Lambda Zap libraries obtained from Stratagene, was performed in 
accordance with the manufacturer's instructions except that the prehybridization and 
5 hybridization was performed in 2xSSC, 0.1% SDS, lOxDenhardt's and 100/Ag/ml 
denatured salmon sperm DNA. 

To the hybridization solution a 32P-labeled denatured probe was added. Hybridization 
was performed over night at 55T. The filters were washed twice in 2xSSC, 0.1% 
10 SDS and twice in IxSSC, 0.1 % SDS. 

B.3.6 PROBE 

The cloned PCR fragments were isolated from the pT7 blue vector by digestion with 
15 ^propriate restriction enzymes. The fragments were seperated from the vector by 
agarose gcl electrophoresis and the fragments were purified from the agarose by 
Agarase (Boehringer Mannheim), As the fragments were only 90-240 bp long the 
isolated fragments were exposed to a ligation reaction before labelling with 32P-dCTP 
using either Prime-It random primer kit (Stratagene) or Ready to Go DNA labelling 
20 kit (Pharmacia). 

B.3.7 RESULTS 

B.3.7.1 Generation of PCR DNA fragments coding for a-l,4-glucan lyase. 

25 

The amino acid sequences (shown below) of three ov^lapping tryptic peptides from 
a-l,4-glucan lyase were used to generate mixed oligonucleotides, which could be 
used as PCR primers for amplification of DNA isolated from both MC and MV. 



30 
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Lys Asn Leu His Pro Gin His Lys Met Leu Lys Asp Thr Val Leu Asp lie Val Lys 
Pro Gly His Gly Glu Tyr Val Gly Trp Gly GIu Met Gly Gly He Gin Phe Met Lys 
GIu Pro Thr Phe Met Asn Tyr Phe Asn Phe Asp Asn Met Gin Tyr Gin Gin Val Tyr 
Ala Gin Gly Ala Leu Asp Ser Arg Glu Pro Leu Tyr His Ser Asp Pro Phe Tyr 

5 

In the first PCR amplification primers A1/A2 (see belov/) were used as upstream 
primers and primers B1/B2 (see below) were used as downstream primer. 

Primer Al: CA(GA)CA(CT)AA(GA)ATGCT(GATC)AA(GA)GA(CT)AC 
10 Primer A2: CA(GA)CA(CT)AA(GA)ATGTT(GA)AA(GA)GA(CT)AC 
Primer Bl: TA(GA)AA(GATC)GG(GA)TC(GA)CT(GA)TG(GA)TA 
Primer B2: TA(GA)AA(GATC)GG(GA)TC(GATC)GA(GA)TG(GA)TA 

The PCR products were analysed on a 2% LMT agarose gel and fragments of the 
15 expected sizes were cut out from the gel and treated with Agarase (Boehringer 
Manheim) and cloned into the pTTblue Vector (Novagen) and sequenced. 

The cloned fragments from the PCR amplification coded for amino acids 
corresponding to the sequenced peptides (see above) and in each case in addition to 
20 two intron sequences. For MC the PCR amplified DNA sequence corresponds to the 
sequence shown as from position 1202 to position 1S22 with reference to Figure 14. 
For MY the PCR amplified DNA sequence corre!qx>nds to the sequence shown as 
from position 1218 to position 1S3S with reference to Figure 15. 

25 B.3.7.2 Screening of the genomic libraries with the cloned PCR fragments. 

Screening of the libraries with the above-mentioned clone gave two clones for each 
source. For MC the two clones were combined to form the sequence shown in 
Figure 14 (see below). For MV the two clones could be combined to form the 
30 sequence shown in Figure 15 in the manner described above. 
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An additional PGR was performed to supplement the MC clone with PstI, Pvull, AscI 
and Ncol restriction sites immediately in front of the ATG start codon using the 
following oligonucleotide as an upstream primer: 
AAACTGCAGCTGGCGCGCCATGGCAGGATnrCTGAT 
5 and a primer containing the complement sequence of bp 1297-1318 in Figure 4 was 
used as a downstream primer. 

The complete sequence for MC was generated by cloning the 5' end of the gene as 
a Bglll-EcoRI fragment from one of the genomic clone (first clone) into the BamHI- 

10 EcoRI sites of pBluescript II KS+ vector from Stratagene. The 3' end of the gene 
was then cloned into the modified pBluescript II KS+ vector by ligating an NspV 
(blunt ended, using the DNA blunting kit from Amersham Intemational)-EcoRI 
fragment from the other genomic clone (second clone) after the modified pBluescript 
n KS + vector had been digested with EcoRI and EcoRV, Then the intermediate part 

15 of the gene was cloned in to the further modified pBluescript II KS+ vector as an 
EcoRI fragment from the first clone by ligating that fragment into the further 
modified pBluescript II KS+ vector digested with EcoRI. 

B.4, EXPRESSIO N OF THE GL GENE IN MICRO-ORGANISMS 

20 

The DNA sequence encoding the GL can be introduced into microorganisms to 
produce the enzyme with high specific activity and in large quantities. 

In this rpgard, the MC gene (Figure 14) was cloned as a Xbal-Xhol blunt ended 
25 (using the DNA blunting kit from Amersham International) fragment into the Pichia 
expression vector pHIL-D2 (containing the AOXl promoter) digested with EcoRI and 
blunt ended (using the DNA blunting kit from Amersham International) for expression 
in Pichia pasioris (according to the protocol stated in the Kchia Expression Kit 
supplied by Invitrogen). 
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In another embodiment, the MC gene 1 (same as Figure 14 except that it was 
modified by PCR to introduce restriction sites as described above) was cloned as a 
Pvull-Xhol blunt ended fragment (using the DNA blunting kit from Amersham 
International) into the Aspergillus expression vector pBARMTEl (containing the 
5 methyl tryptophan resistance promoter from Neuropera crassa) digested with Smal 
for expression in Aspergillus niger (Pall et al (1993) Fungal Genet Newslett. vol 40 
pages 59-62). The protoplasts were prepared according to Daboussi et al (Curr Genet 
(1989) vol 15 pp 453-456) using lysing enzymes Sigma L-2773 and the lyticase Sigma 
1^8012. The transformation of the protoplasts was followed according to the protocol 
10 stated by Buxton et al (Gene (1985) vol 37 pp 207-214) except that for plating the 
transformed protoplasts the protocol laid out in Punt et al (Methods in Enzymology 
(1992) vol 216 pp 447 - 457) was followed but with the use of 0.6% osmotic 
stabilised top agarose. 

15 The results showed that lyase activity was observed in the transformed Kchia pasioris 
and Aspergillus niger 

ANALYSES OF PICHIA LYASE TRANSFORMANTS AND ASPERGILLUS 
LYASE TRANSFORMANTS 

20 

GENERAL METHODS 
Prq>aration of cell-free extracts. 

25 The cells were harvested by centrifugation at 9000 rpm for 5 min and washed with 
0.9% NaCl and resuspended in the breaking buffer (50mM K-phosphate, pH 7.5 
containing ImM of EDTA, and 5% glycerol). Cells were broken using glass beads 
and vortex treatment. The breaking buffer contained 1 mM PMSF (protease inhibi- 
tor). The lyase extract (supernatant) was obtained after centrifugation at 9000 rpm for 

30 5 min followed by centrifugation at 20,0(X) xg for 5min. 
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Assay of lyase activity by alkaline 3,5-dinitrosaIicylic acid reagent (DNS) 

One volume of lyase extract was mixed with an equal volume of 4% amylopectin 
solution. The reaction mixture was then incubated at a controlled temperature and 
5 samples were removed at specifled intervals and analyzed for AP. 

The lyase activity was also analyzed using a radioactive method. 

The reaction mixture contained 10 /d '^C-starch solution (1 nCi; Sigma Chemicals 
10 Co.) and 10 fil of the lyase extract. The reaction mixture was left at 25''C overnight 
and was then analyzed in the usual TLC system. The radioactive AF produced was 
detected using an Instant Imager (Pachard Instrument Co., Inc., Meriden, CT). 

Electrophoresis and Western blotting 

15 

SDS-PAGE was performed using 8-25% gradient gels and the PhastSystem 
(Pharmacia). Western blottings was also run on a Semidry transfer unit of the 
PhastSystem. Primary antibodies raised against the lyase piurified from the red 
seaweed collected at Qlngdao (China) were used in a dilution of 1 : 100. Pig antirabbit 
20 IgG conjugated to alkaline phosphatase (Dako A/S, Glostrup, Denmark) were used 
as secondary antibodies and used in a dilution of 1: 1000. 
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Part Analysis of the Pichia traiisfonnant^containing the above mentioned 
construct 



5 

MC-Lyase expressed intracellularly in Pichia pastoris 



Names of culture 


Specific activity* 


A18 


10 


A20 


32 


A21 


8 


A22 


8 


A24 


6 



"The specific activity was defined as nmol of AF produced per min per mg protein 
at 25''C. 
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Part II, The Aspergilus transfonmants 
Results 

S h Lyase activity was determined after 5 days incubation(minimaI medium containing 
0.2% casein enzymatic hydrolysate analysis by the alkaline 3,S-dinitrosalicylic acid 



10 



reagent 




Lyase activity analysis 


1 in cell-firee extracts 


Name of the culture 


Spedfic activity* 


8.13 


11 


8.16 


538 


8.19 


37 



"The spedfic activity was defined as nmol of AF produced per min per mg protein 
20 at 25*^:. 

The results show that the MC-lyase was expressed intracellular in A. fiiger. 
U. Lyase activity test by radioactive method 

25 

The cell-free extracts of the following cultures contained '''C labelled AF 

51 + , 54+, 55+, 59+, 512, 513, 514, 515, 516, 518, 519. 
The TLC of the degradation products of the a-l,4-glucan lyase reaction using ^'^C- 
30 starch as substrate are shown in Figure 20. The reaction mixture was applied on the 
TLC, The lane number corresponds to the name of the culture: 1, 512; 2, 513; 3, 
514; 4, 515; 5, 516; 6, 517; 7, 518; 8, 519; 9, 520. The fast moving spots are AF. 
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C. SOURCE = ALGAE ALONE 

The protocolis for the enzyme purification and charaterisation of alpha 1 ,4-glucaI lyase 
obtained from Gracilarioposis lemaneifomis (as obtained from Santa Cruz) were 
5 essentially the same as those described above for, for exaniple, Morchella Costata 
(with similar results). 

1. Characterization of a-l,4-glucan lyase from the parasite-free red seaweed 
Gracilariopsis lemanelfonms collected in California. 

10 

Hie amino acid composition of the lyase is given in the following table. 
Amino acid residues mol % of each residue 





Asx 


15.42 


15 


Thr 


5.24 




Ser 


6.85 




Glx 


9.46 




Pro 


5.46 




Gly 


9.08 


20 


Ala 


5.38 




l/2Cys 


1.57 




Val 


6.60 




Met 


2.90 




Ue 


3.66 


25 


Leu 


6.00 




Tyr 


6.00 




Phe 


4.37 




His 


1.65 




Lys 


4.44 


30 


Arg 


4.17 




Trp 


1.75 




Total: 


100.00 



WO9i$/10616 



PCT/EP94/03397 



71 

2. SEQUENCE ANALYSIS 

Comparison of the peptide sequences from the Califomian algae with the amino acid 
sequence from the fungally infected algae from China showed a high degree of 
5 homology (78 to 80% identity between the amino acid sequence generated from the 
PCR fragments and the corresponding sequences in the GL obtained from the algae 
from China) between the two protein sequences. 

Three Oligonucleotides was generated from these two sequences from the Califomian 
10 aigae to generate a PCR fragmmt of app. 970 bp. 

Primer 1: ATGAC(GATQAA(CT)TA(CT)AA(CT)TA(CT)GA(CT)AA 
Primer 2: (AG)TG(GATC)GGCATCAT(GATC)GC(GATC)GG(GATC)AC 
Primer 3: GTCAT(GA)TC(CT)TGCCA(GATC)AC(GA)AA(GA)TC 

15 

Primer I was used as the upstream primer and primer 2 was used as the downstream 
primer in the first PCR amplification. In the second PCR amplification primer 1 was 
used as the upstream primer and primer 3 was used as the downstream primer. A 
PCR fragment of the expected size was generated and cloned into the pTTbluc vector 
20 from Novagen. Three indqiendent plasmids containing a PCR fragment were 
sequenced and it was seen that these three cloned PCR figments contained the 
codons for peptide sequences originating from three different proteins. This indicates 
that there are at least three different genes coding for (x-l,4*glucan lyase in the 
Califomian algae. 

25 

3. The substrate concentration at which half of the maximal velocity rate was reached 
is 3.76 mg/ml for amylopectin and 3.37 mg/ml for glycogen. 
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4. The degradation rates of the lyase on various substrates are given below. 



Substrate 


AF released (nmol) 


Maltose 


657 


Maltotriose 


654 


Maltotetiaose 


670 


Maltopentaose 


674 


Maltohexaose 


826 


Maltoheptaose 


865 


Dextrin 20 


775 


Dextrin IS 


775 


Dextrin 10 


844 


Amylopectin 


732 


Glycogen 


592 



30 



Reaction conditions: The reaction mixture contained 10 mM of HOAc-NaOAc (pH 
3.8)« The substrate concentration was 10 mg/ml. The final volume was 100 ul after 
the addition of lyase and water. The reaction time was 40 min at 45%^. 
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The lyase was not able to degrade puUulan, nigeran tetrasaccharide, trehalose, 
isomaltose, glucose, a-, 6- and r-cyclodextrins. The lyase degraded panose and 
nigerose though at a slow rate. 

5 5. The temperature optimum for the lyase was 48^C when amylopectin was used as 
substrate and SOPC when glycogen was used as substrate. At 50°C, the reactivity of 
glycogen was similar to that of amylopectin; below SOPC, amylopectin was a better 
substrate than glycogen. 

10 6« The pH optimum range for the lyase was between pH 3.5 and pH 7,0; the optimal 
pH was 3.8. The buffers used in the pH tests were glycine-HCl (pH 2.2-3.6); 
NaOAc-HOAc (pH 3.5-5.5); Mes-NaOH (pH 5.5-6.7); Mops-NaOH (pH 6.0-8.0) and 
bicine-NaOH (pH 7.6-9.0). All buffers used were 40 mM. 

15 7. At a final concentration of 2 mM, p-chloromercuribenzoic acid (PCMB) inhibited 
the lyase activity by 96% , indicating the -SH gToup(s) is essential for the enzymatic 
activity. 

7. FURTHER STUDIES 

20 

7.1 Effect of alcohols in increasing the activity and stability of the lyase purified 
from the fungal infected algae. 

1-propanoI, 2-propanoI and I-butanol were tested at the following concentrations (0% , 
25 I %, 5 % and 10%). The optimal concmtration of 1-propanoi was 5% which increased 
the AF yield by 34% after 6 days of incubation; the optimal concentration for 2- 
propanol was 1 % which increased the AF yield by 20% after 10 days incubation; the 
optimal concentration for 1-butanol was 5% which increased the AF yield by 52% 
after 3-day incubation. 

30 

Ethanol was tested at the following concentrations (0, 1, 3, 5, 7, 9, 11, 13, 15%). 
The optimal concentration for 7 days incubation was 5% which increased the AF 
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yield by 12%. For 10 days incubation the optimal concentration was 3% which in- 
creased AF yield by 16%. 

The effect of 1-propanol: 

5 

I^ropanol Reaction time (days) 



conceatracuon 












(v/v) 


0 


1 


3 


6 


10 






AF yield (pmol) 






0% 


0 


S4 


261 


451 


689 


1% 


0 


80 


2S0 


530 


803 


5% 


0 


lis 


367 


605 


853 


10% 


0 


107 


307 


456 


583 



15 

7.2 Effect of different reaction media upon the production of AF by the lyase purified 
from the fungal infected algae and the fugnal lyase from M. costota and Af. vulgaris. 

2.1. The lyase from the fungal infected algae. 

20 

The results (see table below) indicate that the best reaction medium is 5 mM of 
HOAc-NaOAc (pH 3.9) (BACE for short) and containing mM concentrations of Naj- 
EDTA. The production of AF using either pure water or 0.85% NaCl as reaction 
medium decreased the yield. Inclusion of 0.85% of NaCI in BACE also decreased the 
25 AF yield. 



Reaction Reaction Time (days) 
Media 



30 0 1 3 8 

AF yield (^mol) 

BACE 0 229 498 575 

Water 0 46 128 217 

NaCl (0.85%) 0 123 239 249 

35 BACE+NaCi (0.85%) 0 153 281 303 
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2.2. The following buffers: Mes-NaOH, Mops-NaOH, Hepes-NaOH, and Bicine- 
NaOH were the optimal reaction media for the lyase from M, comta and -Af. 
vulgaris. In the HOAc-NaOAc buffer, the lyase was unstable and therefore use of this 
buffer system caused a decrease in AF yield. 

5 

7.3. The effect of endoamylases and debranching enzymes upon the AF 
production. 

3.1. The effect of endoamylase 

10 

The starch used for AF producdon may first be liquified either by endoamylases, or 
by add hydrolysis. 

Endoamylase degraded starch is more suitable as substrate for the lyase as compared 
IS to native starch. Starch has a limited solubility at the temperature used for the lyase- 
catalyzed reaction. Treatmoit of starch with mdoamyiases led to increased glucose 
yied. It was found that a reducing matter of around 10-15% (on a dry mater basis) 
was most suitable as substrate for the lyase with respect to AF yield and further 
treatment with the endoamylase to a reducing matter of 19% was no longer suitable 
20 for the lyase. 

3.2. The effect of piilhilaiiase and isoamylase 

As seen from the results below, both the isoamylase and the puUulanase increased AF 
25 yield by up to 50% at pH 4.S and 5.0. The reaction system consisted of the lyase 
from the fungal affected red algae with or mthout the addition of isoamylase or 
puUulanase (MegaZyme Ltd.). Amylqpectin was used as substrate. The AF produced 
in the presence of only the lyase was expressed as 100%. 
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The pH of the reaction medium 



5 


Enzymes added 


3.5 


4.5 


5.0 




Lyase only 


100 


100 


100 




Lyase + isoamylase 


136 


152 


150 


10 


Lyase + pullulanase 


132 


158 


155 



4. Hie relative degrading rates of the fungal lyase towards various substrates 

15 4.1. The lyase from M. costata. 

The activity obsoved with maltotetiaose is expressed as 100%. 



Substrate concentration 2mg/ml 4nig/ml lOmg/ml 

Maltose 0.5 1.6 2.2 

20 Maltotriose 40.6 58.6 56.0 

Maltotetiaose 100 100 100 

Maltopentaose 107.1 100.1 99.7 

Maltohexaose 86.6 98.2 95.9 

Maltoheptaose 82.2 81.5 75.7 

25 Dextrin 10* -** - 68.3 

Dextrin 15* - - 61.1 

Dextrin 20* - - 46.6 

Soluble Stardi - - 92.9 

Amylopectin - - 106.5 

30 glycogen - - 128.5 

* the number indicates the contents of the reducing matter on a dry weight basis. **, 

not dctmnined. 
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4J1. The lyase from M. vulgaris lyase. 

The activity observed for maltotetraosc is treated as 100%. The final concentration 
of all substrates was 10 mg ml ^ 







Substrates 


Activity (%) 


Maltose 


10.1 


Maltotriose 


49.8 


Maltotetiaose 


100.0 


Maltopentaose 


79.3 


Maltohexaose 


92.4 


Maltohqitaose 


73.9 


D«trin 10 


62 


Dextrin 15 


45 


Dextrin 20 


37 


Soluble starch 


100.5 


Amylopectin 


139.9 


Glycogen 


183.3 
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Hie lyase from M, costata and M, vulgaris was unable to degrade the following 
sugars. 

Trehalose, panose, nigerose, nigerotetraose, glucose, isomaltose, alpha-, beta and 
5 gama-cyclodextrins, pullulalans and non-reducing end blocked p-nitrophenyl a-D- 
maltoheptaoside as there was no AF detectable on a TLC plates after these substrates 
had been incubated for 48 h with the fungal lyase. 

7.5. pH and temperature optimum for the lyase catalyzed reaction. 

10 



GL sources Optimal pH Optimal pH range Optimal temperature 
M. cosma 6.5 5.5-7.5 37 C; 40 C* 

M. vulgaris 6.4 5.9-7.6 43 C; 48 C- 



Fungal infected Gracilariopsis 

lemaneiformis 3.8 3.7-4.1 40 C; 45 C* 

"Parameters determined using glycogen as substrate; other parameters determined 
using amylopectin as substrate. 

7*6. The stabiliadng effect of glycogen on the lyase from the fungal infected 
25 Gracilariopsis lemaneiformis. 

The results indicate that at higher temperatures the reaction rates were higher when 
glycogen was used as substrate instead of amylopectin. 
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Reaction temperature 

Substrates 25 C 30 C 45 C 

5 Amylopectin 0.818* L133' LHr 

Glycogen 0.592* 0.904* 1.861* 

The ratio of relative reaction rates between Glycogen and Amylopectin (%) 
72.4 79.8 158.9 

10 % the relative reaction rates. 

7.7. The molecular masses and pi values of the lyases 

The molecular masses of the lyases from the fungal infected G. lemaneiforniis^ both 
IS forms of lyase from q[>paient fungal free G. lemaneiformis, from Af. cosutta and M. 
vulgaris were estimated to 1 10,000 ± 10,000 daltons usind SDS-PAGE on a gradient 
gel (8-25%). 

The pi of the lyase from the fiingal infected G. lemaneiformis was around 3.9. For 
20 the lyase from M. vuglaris, the pi was around pH 4.6 and the pi for the lyase from 
Af. costata was around 5.0. These values were obtained by isoelectric focusing on a 
gel with a pH gradient from 3 to 9. 

The pi values deduced from the amino acid compositions are: 
25 The lyase from the fungal infected G. lemaneiformis: 4.58 and for the lyase from A/, 
costata: 6.30. 

7.8. Immunological test of the lyase by Western blotting. 

30 The results showed that the antibodies to the algal lyase could recognize the fungal 
lyase both in cell-free extracts and in purified form, as revealed by Western blottings. 
The antibodies to the algal lyase purified form the algae collected from China also 
recognized the lyase from the algae collected from Sant Cruz, Califonia. 
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GL sources 



Reactivity with the antibodies against the GL 
from the fiiogal infected G. lemaneifomus 



Fungal infected G. lemaneifomus 



Strong 




both form of GL 



Strong 



M. cosrara 



medium 



M. vulgaris 



medium 



7.9. Reversible and Irreversible Inhibitors of the fungal lyase 
9 A. The reversible inhibitorsi Glucose and Maltose. 

At a substrate concentration of lOmg/ml, the activity for the M, costata lyase decrea- 
sed by 19.3 % in the presence of 0.1 M glucose when araylopectin was used as 
substrate; the activity was not affected when glycogen was used as substrate. In the 
presence of 0.1 M of maltose the activity decreased by 48.8 % and 73.4%, respec- 
tively for glycogen and amylopectin. 



Substrates 


Inhibitors 




Concentrations 


Glucose 


Maltose 


Amylopectin 1% (2%) 


19.3 %f7%) 


73.4% (67.2%) 


Glycogen 1% (2%) 


0.000 (-) 


48.8% (49.7%) 



It seems that the inhibition by 0. 1 M glucose is competitive as increasing the substrate 
from 1 % to 2% decreased the inhibition from 19.3 to 7%, whereas the inhibition by 
0. 1 M maltose is non-competitive as the increase of substrate did not significantly 
affect the inhibition degree. 
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For the M. vulgaris lyase, 0. 1 M glucose and maltose did also inhibit the reaction 
when either amylopcx:tin or glycogen was used as substrate. 



Substrates Glucose Maltose 

5 

Amylopectin (1%) 28% 80% 



Glycogen (1%) 5% 57% 

10 

9,2. Hie reversible inhibitor deoxyjirimycia 

At a final substrate concentration of 2%, the activity was decreased to 10.4% for the 
algal lyase and the M.costata lyase in the presence of 25 of deoxyjirimycin, using 
IS amylopectin as substrate. At 100 /iM, the activity of both lyases was completely lost. 

9.3« Irreversible Inhibitor: PCMB 

Under the same assay conditions and in the presence of 2 mM PCMB, the activity 
20 decreased by 60% for the Af. castata lyase and 98 % for the lyase from the fungal 
infected red algae. This means that the fungal lyase was much less sensitive to heavy 
metal inhibition. 

7.10. Examples of laboratory scale production of AF 

25 

10.1. Production of AF using dextrin as substrate 

The reactor contained 1000 g dextrins (obtained by treatment of starch with 
TCTmamyl to a final reducing matter of 10 %) in a final volume of 4.6 liter (HOAC- 
30 NaOAC, pH 3.9, containing 5 mM Naj-EDTA). The reaction was initiated by adding 
3 mg lyase purified from fungal infected algae. The reaction was performed at room 
temperature. At day 19, another batch of lyase (4 mg) was added. 
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Reaction time (days) 

0 1 7 13 19 24 31 



5 AF produced (grams) 

0 18 116 195 264 500 668 

10.2. Using "C-Starch for the production of "C-AF 

10 

The uniformly labelled "C-starch (340 ftCi obtained from Sigma) was vaccum-dried 
to remove the cthanol it contained and then dissolved in 2 ml water. The reaction was 
initiated by adding 20 iya^ purified from the fungal infected algae and 20 id 
pullulanase (MegaZyme Ltd.) The reaction was p^ormed overnight at 30 ""C. At the 
IS end of the reaction, the reaction mixture was filtered using a filter with a molecular 
mass cut off of 10,000 to remove the enzymes and unreacted starch molecules. 

The filtrate was applied on a Caj carbohydrate colunm (Chrompack) using a Waters 
HPLC. Water was used as eluent. The flow rate was 0.5 ml/min. AF was efficiently 
20 separated from glucose and maltosaccharides. The pooled AF fractions were freeze- 
dried and totally 140 fiCi '^C-AF was obtained. 

These fmdings relate to an even further aspect of the present invention, namely the 
use of a reagent that can increase the hydrophobicity of the reaction medium 
25 (preferably an alcohol) to increase the stability and activity of the lyase according to 
the present invention. This increased stability leads to a increased AF yield. 

Other modifications of the present invention will be apparent to those sldlled in the 
art without departing from the scope of the invention. 

30 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: OANISCO A/S 

(B) STREET: LANGEBROGADE 1 

(C) CITY: COPENHAGEN 

(D) STATE: COPENHAGEN K 

(E) COUNTRY: DENMARK 

(F) POSTAL CODE (ZIP) : DK-IOOl 

(ii) TITLE OF INVENTION: USE OF AN ENZYME 
(111) NUMBER OF SEQUENCES: 39 

(1v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: -PC-OOS/HS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 (EPO) 

(V) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/EP94/03397 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1088 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: I: 

Met Phe Ser Thr Leu Ala Phe Val Ala Pro Ser Ala Leu Gly Ala Ser 
15 10 15 

Thr Phe Val Gly Ala Glu Val Arg Ser Asn Val Arg He His Ser Ala 
20 25 30 

Phe Pro Ala Val His Thr Ala Thr Arg Lys Thr Asn Arg Leu Asn Val 
35 40 45 

Ser Met Thr Ala Leu Ser Asp Lys Gin Thr Ala Thr Ala Gly Ser Thr 
50 55 60 

Asp Asn Pro Asp Gly He Asp Tyr Lys Thr Tyr Asp Tyr Val Gly Val 
65 70 75 80 

Trp Gly Phe Ser Pro Leu Ser Asn Thr Asn Trp Phe Ala Ala Gly Ser 
85 90 95 
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Ser Thr Pro 61y Gly He Thr Asp Trp Thr Ala Thr Met Asn Val Asn 
100 105 110 

Phe Asp Arg He Asp Asn Pro Ser He Thr Val Gin His Pro Val Gin 
115 120 125 

Val 61 n Val Thr Ser Tyr Asn Asn Asn Ser Tyr Arg Val Arg Phe Asn 
130 135 140 

Pro Asp Gly Pro lie Arg Asp Val Thr Arg Gly Pro He Leu Lys Gin 
145 150 155 160 

61n Leu Asp Trp He Arg Thr Gin Glu Leu Ser Glu Gly Cys Asp Pro 
165 170 175 

Gly Met Thr Phe Thr Ser Glu Gly Phe Leu Thr Phe Glu Thr Lys Asp 
180 185 190 

Leu Ser Val He He Tyr Gly Asn Phe Lys Thr Arg Val Thr Arg Lys 
195 200 205 

Ser Asp Gly Lys Val He Met Glu Asn Asp Glu Val Gly Thr Ala Ser 
210 215 220 

Ser Gly Asn Lys Cys Arg Gly Leu Met Phe Val Asp Arg Leu Tyr Gly 
225 230 235 240 

Asn Ala lie Ala Ser Val Asn Lys Asn Phe Arg Asn Asp Ala Val Lys 
245 250 255 

Gin Glu Gly Phe Tyr Gly Ala Gly Glu Val Asn Cys Lys Tyr Gin Asp 
260 265 270 

Thr Tyr He Leu Glu Arg Thr Gly He Ala Met Thr Asn Tyr Asn Tyr 

275 280 285 

Asp Asn Leu Asn Tyr Asn Gin Trp Asp Leu Arg Pro Pro His His Asp 
290 295 300 

Gly Ala Leu Asn Pro Asp Tyr Tyr He Pro Met Tyr Tyr Ala Ala Pro 
305 310 315 320 

Trp Leu lie Val Asn Gly Cys Ala Gly Thr Ser Glu Gin Tyr Ser Tyr 
325 330 335 

Gly Trp Phe Met Asp Asn Val Ser Gin Ser Tyr Met Asn Thr Gly Asp 
340 345 350 

Thr Thr Trp Asn Ser Gly Gin Glu Asp Leu Ala Tyr Met Gly Ala Gin 
355 360 365 

Tyr Gly Pro Phe Asp Gin His Phe Val Tyr Gly Ala Gly Gly Gly Met 
370 375 380 

Glu Cys Val Val Thr Ala Phe Ser Leu Leu Gin Gly Lys Glu Phe Glu 
385 390 395 400 
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Asn Gin Val Leu Asn Lys Arg Ser Val Met Pro Pro Lys Tyr Val Phe 
405 410 415 

Gly Phe Phe Gin Gly Val Phe Gly Thr Ser Ser Leu Leu Arg Ala His 
420 425 430 

Met Pro Ala Gly Glu Asn Asn He Ser Val Glu Glu lie Val Glu Gly 
435 440 445 

Tyr Gin Asn Asn Asn Phe Pro Phe Glu Gly Leu Ala Val Asp Val Asp 
450 455 460 

Met Gin Asp Asn Leu Arg Val Phe Thr Thr Lys Gly Glu Phe Trp Thr 
465 470 475 480 

Ala Asn Arg Val Gly Thr Gly Gly Asp Pro Asn Asn Arg Ser Val Phe 
485 490 495 

Glu Trp Ala His Asp Lys Gly Leu Val Cys Gin Thr Asn He Thr Cys 
500 505 510 

Phe Leu Arg Asn Asp Asn Glu Gly Gin Asp Tyr Glu Val Asn Gin Thr 
515 520 525 

Leu Arg Glu Arg Gin Leu Tyr Thr Lys Asn Asp Ser Leu Thr Gly Thr 
530 535 540 

Asp Phe Gly Met Thr Asp Asp Gly Pro Ser Asp Ala Tyr He Gly His 
545 550 555 560 

Leu Asp Tyr Gly Gly Gly Val Glu Cys Asp Ala Leu Phe Pro Asp Trp 
565 570 575 

Gly Arg Pro Asp Val Ala Glu Trp Trp Gly Asn Asn Tyr Lys Lys Leu 
580 585 590 

Phe Ser He Gly Leu Asp Phe Val Trp Gin Asp Met Thr Val Pro Ala 
595 600 605 

Met Met Pro His Lys He Gly Asp Asp He Asn Val Lys Pro Asp Gly 
610 615 620 

Asn Trp Pro Asn Ala Asp Asp Pro Ser Asn Gly Gin Tyr Asn Trp Lys 
625 630 635 640 

Thr Tyr His Pro Gin Val Leu Val Thr Asp Met Arg Tyr Glu Asn His 
645 650 655 

Gly Arg Glu Pro Met Val Thr Gin Arg Asn He His Ala Tyr Thr Leu 
660 665 670 

Cys Glu Ser Thr Arg Lys Glu Gly He Val Glu Asn Ala Asp Thr Leu 
675 680 585 

Thr Lys Phe Arg Arg Ser Tyr He He Ser Arg Gly Gly Tyr lie Gly 
690 695 700 
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Asn Gin His Phe Gly Gly Met Trp Val Gly Asp Asn Ser Thr Thr Ser 
705 710 715 720 

Asn Tyr lie Gin Met Met He Ala Asn Asn He Asn Met Asn Met Ser 
725 730 735 

Cys Leu Pro Leu Val Gly Ser Asp He Gly Gly Phe Thr Ser Tyr Asp 
740 745 750 

Asn 61 u Asn Gin Arg Thr Pro Cys Thr Gly Asp Leu Met Val Arg Tyr 
755 760 765 

Val Gin Ala Gly Cys Leu Leu Pro Trp Phe Arg Asn His Tyr Asp Arg 
770 775 780 

Trp He Glu Ser Lys Asp His Gly Lys Asp Tyr Gin Glu Leu Tyr Met 
785 790 795 800 

Tyr Pro Asn Glu Met Asp Thr Leu Arg Lys Phe Val Glu Phe Arg Tyr 
805 810 815 

Arg Trp Gin Glu Val Leu Tyr Thr Ala Met Tyr Gin Asn Ala Ala Phe 
820 825 830 

Gly Lys Pro He He Lys Ala Ala Ser Met Tyr Asn Asn Asp Ser Asn 
835 840 845 

Val Arg Arg Ala Gin Asn Asp His Phe Leu Leu Gly Gly His Asp Gly 
850 855 860 

Tyr Arg He Leu Cys Ala Pro Val Val Trp Glu Asn Ser Thr Glu Arg 
865 870 875 880 

Glu Leu Tyr Leu Pro Val Leu Thr Gin Trp Tyr Lys Phe Gly Pro Asp 
885 890 895 

Phe Asp Thr Lys Pro Leu Glu Gly Ala Met Asn Gly Gly Asp Arg lie 
900 905 910 

Tyr Asn Tyr Pro Val Pro Gin Ser Glu Ser Pro He Phe Val Arg Glu 
915 920 925 

Gly Ala He Leu Pro Thr Arg Tyr Thr Leu Asn Gly Glu Asn Lys Ser 
930 935 940 

Leu Asn Thr Tyr Thr Asp Glu Asp Pro Leu Val Phe Glu Val Phe Pro 
945 950 955 960 

Leu 61y Asn Asn Arg Ala Asp Gly Met Cys Tyr Leu Asp Asp Gly Gly 
965 970 975 

Val Thr Thr Asn Ala Glu Asp Asn Gly Lys Phe Ser Val Val Lys Val 
980 985 990 

Ala Ala Glu Gin Asp Gly Gly Thr Glu Thr He Thr Phe Thr Asn Asp 
995 1000 1005 
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Cys Tyr Glu Tyr Val Phe Gly Gly Pro Phe Tyr Val Arg Val Arg Gly 
1010 1015 1020 

Ala Gin Ser Pro Ser Asn He His Val Ser Ser Gly Ala Gly Ser Gin 
1025 1030 1035 1040 

Asp Met Lys Val Ser Ser Ala Thr Ser Arg Ala Ala Leu Phe Asn Asp 
1045 1050 1055 

Gly Glu Asn Gly Asp Phe Trp Val Asp Gin Glu Thr Asp Ser Leu Trp 
1060 1065 1070 

Leu Lys Leu Pro Asn Val Val Leu Pro Asp Ala Val lie Thr He Thr 
1075 1080 1085 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1091 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(i1) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Tyr Pro Thr Leu Thr Phe Val Ala Pro Ser Ala Leu Gly Ala Arg 
15 10 15 

Thr Phe Thr Cys Val Gly He Phe Arg Ser His lie Leu lie His Ser 
20 25 30 

Val Val Pro Ala Val Arg Leu Ala Val Arg Lys Ser Asn Arg Leu Asn 
35 40 45 

Val Ser Met Ser Ala Leu Phe Asp Lys Pro Thr Ala Val Thr Gly Gly 
50 55 60 

Lys Asp Asn Pro Asp Asn lie Asn Tyr Thr Thr Tyr Asp Tyr Val Pro 
65 70 75 80 

Val Trp Arg Phe Asp Pro Leu Ser Asn Thr Asn Trp Phe Ala Ala Gly 
85 90 95 

Ser Ser Thr Pro Gly Asp He Asp Asp Trp Thr Ala Thr Met Asn Val 
100 105 no 

Asn Phe Asp Arg He Asp Asn Pro Ser Phe Thr Leu Glu Lys Pro Val 
115 120 125 

Gin Val Gin Val Thr Ser Tyr Lys Asn Asn Cys Phe Arg Val Arg Phe 
130 135 140 
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Asn Pro Asp Gly Pro He Arg Asp Val Asp Arg Gly Pro lie Leu Gin 
145 150 155 160 

Gin Gin Leu Asn Trp lie Arg Lys Gin Glu Gin Ser Lys Gly Phe Asp 
165 170 175 

Pro Lys Met Gly Phe Thr Lys Glu Gly Phe Leu Lys Phe Glu Thr Lys 
180 185 190 

Asp Leu Asn Val lie lie Tyr Gly Asn Phe Lys Thr Arg Val Thr Arg 
195 200 205 

Lys Arg Asp Gly Lys Gly lie Met Glu Asn Asn Glu Val Pro Ala Gly 
210 215 220 

Ser Leu Gly Asn Lys Cys Arg Gly Leu Met Phe Val Asp Arg Leu Tyr 
225 230 235 240 

Gly Thr Ala lie Ala Ser Val Asn Glu Asn Tyr Arg Asn Asp Pro Asp 
245 250 255 

Arg Lys Glu Gly Phe Tyr Gly Ala Gly Glu Val Asn Cys Glu Phe Trp 
260 265 270 

Asp Ser Glu Gin Asn Arg Asn Lys Tyr lie Leu Glu Arg Thr Gly lie 
275 280 285 

Ala Met Thr Asn Tyr Asn Tyr Asp Asn Tyr Asn Tyr Asn Gin Ser Asp 
290 295 300 

Leu lie Ala Pro Gly Tyr Pro Ser Asp Pro Asn Phe Tyr He Pro Met 
305 310 315 320 

Tyr Phe Ala Ala Pro Trp Val Val Val Lys Gly Cys Ser Gly Asn Ser 
325 330 335 

Asp Glu Gin Tyr Ser Tyr Gly Trp Phe Met Asp Asn Val Ser Gin Thr 
340 345 350 

Tyr Met Asn Thr Gly Gly Thr Ser Trp Asn Cys Gly Glu Glu Asn Leu 
355 360 365 

Ala Tyr Met Gly Ala Gin Cys Gly Pro Phe Asp Gin His Phe Val Tyr 
370 375 380 

Gly Asp Gly Asp Gly Leu Glu Asp Val Val Gin Ala Phe Ser Leu Leu 
385 390 395 400 

Gin Gly Lys Glu Phe Glu Asn Gin Val Leu Asn Lys Arg Ala Val Met 
405 410 415 

Pro Pro Lys Tyr Val Phe Gly Tyr Phe Gin Gly Val Phe Gly He Ala 
420 425 430 

Ser Leu Leu Arg Glu Gin Arg Pro Glu Gly Gly Asn Asn He Ser Val 
435 440 445 
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Gin 61u He Val Glu Gly Jyr Gin Ser Asn Asn Phe Pro Leu Glu Gly 
450 455 460 

Leu Ala Val. Asp Val Asp Met Gin Gin Asp Leu Arg Val Phe Thr Thr 
465 470 475 480 

Lys He Glu Phe Trp Thr Ala Asn Lys Val Gly Thr Gly Gly Asp Ser 
485 490 495 

Asn Asn Lys Ser Val Phe Glu Trp Ala His Asp Lys Gly Leu Val Cys 
500 505 510 

Gin Thr Asn Val Thr Cys Phe Leu Arg Asn Asp Asn Gly Gly Ala Asp 
515 520 525 

Tyr Glu Val Asn Gin Thr Leu Arg Glu Lys Gly Leu Tyr Thr Lys Asn 
530 535 540 

Asp Ser Leu Thr Asn Thr Asn Phe Gly Thr Thr Asn Asp Gly Pro Ser 
545 550 555 560 

Asp Ala Tyr lie Gly His Leu Asp Tyr Gly Gly Gly Gly Asn Cys Asp 
565 570 575 

Ala Leu Phe Pro Asp Trp Gly Arg Pro Gly Val Ala Glu Trp Trp Gly 
580 585 590 

Asp Asn Tyr Ser Lys Leu Phe Lys He Gly Leu Asp Phe Val Trp Gin 
595 600 605 

Asp Met Thr Val Pro Ala Met Met Pro His Lys Val Gly Asp Ala Val 
610 615 620 

Asp Thr Arg Ser Pro Tyr Gly Trp Pro Asn Glu Asn Asp Pro Ser Asn 
625 630 635 640 

Gly Arg Tyr Asn Trp Lys Ser Tyr His Pro Gin Val Leu Val Thr Asp 
645 650 655 

Met Arg Tyr Glu Asn His Gly Arg Glu Pro Met Phe Thr Gin Arg Asn 
660 665 670 

Met His Ala Tyr Thr Leu Cys Glu Ser Thr Arg Lys Glu Gly He Val 
675 680 685 

Ala Asn Ala Asp Thr Leu Thr Lys Phe Arg Arg Ser Tyr He He Ser 
690 695 700 

Arg Gly Gly Tyr He Gly Asn Gin His Phe Gly Gly Met Trp Val Gly 
705 710 715 720 

Asp Asn Ser Ser Ser Gin Arg Tyr Leu Gin Met Met lie Ala Asn He 
725 730 735 

Val Asn Met Asn Met Ser Cys Leu Pro Leu Val Gly Ser Asp He G1y 
740 745 750 
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Gly Phe Thr Ser lyr Asp Gly Arg Asn Val Cys Pro Gly Asp Leu Met 
755 760 . 765 

Val Arg Phe Val Gin Ala Gly Cys Leu Leu Pro Trp Phe Arg Asn His 
770 775 780 

Tyr Gly Arg Leu Val 61u Gly Lys Gin 61u Gly Lys Tyr Tyr Gin Glu 
785 790 795 800 

Leu Tyr Met Tyr Lys Asp Glu Met Ala Thr Leu Arg Lys Phe He Glu 
805 810 815 

Phe Arg Tyr Arg Trp Gin Glu Val Leu Tyr Thr Ala Met Tyr Gin Asn 
820 825 830 

Ala Ala Phe Gly Lys Pro He He Lys Ala Ala Ser Met Tyr Asp Asn 
835 840 845 

Asp Arg Asn Val Arg Gly Ala Gin Asp Asp His Phe Leu Leu Gly Gly 
850 855 860 

His Asp Gly Tyr Arg He Leu Cys Ala Pro Val Val Trp Glu Asn Thr 
865 870 875 880 

Thr Ser Arg Asp Leu Tyr Leu Pro Val Leu Thr Lys Trp Tyr Lys Phe 
885 890 895 

Gly Pro Asp Tyr Asp Thr Lys Arg Leu Asp Ser Ala Leu Asp Gly Gly 
900 905 910 

Gin Met He Lys Asn Tyr Ser Val Pro Gin Ser Asp Ser Pro He Phe 
915 920 925 

Val Arg Glu Gly Ala He Leu Pro Thr Arg Tyr Thr Leu Asp Gly Ser 
930 935 940 

Asn Lys Ser Met Asn Thr Tyr Thr Asp Lys Asp Pro Leu Val Phe Glu 
945 950 955 960 

Val Phe Pro Leu Gly Asn Asn Arg Ala Asp Gly Met Cys Tyr Leu Asp 
965 970 975 

Asp Gly Gly He Thr Thr Asp Ala Glu Asp His Gly Lys Phe Ser Val 
980 985 990 

He Asn Val Glu Ala Leu Arg Lys Gly Val Thr Thr Thr He Lys Phe 
995 1000 1005 

Ala Tyr Asp Thr Tyr Gin Tyr Val Phe Asp Gly Pro Phe Tyr Val Arg 
1010 1015 1020 

He Arg Asn Leu Thr Thr Ala Ser Lys He Asn Val Ser Ser Gly Ala 
1025 1030 1035 1040 

Gly Glu Glu Asp Met Thr Pro Thr Ser Ala Asn Ser Arg Ala Ala leu 
1045 1050 1055 
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Phe Ser Asp Gly Gly Val Gly Glu Tyr Trp Ala Asp Asn Asp Thr Ser 
1060 1065 1070 

Ser Leu Trp Met Lys Leu Pro Asn Leu Val Leu Gin Asp Ala Val He 
1075 1080 1085 

Thr He Thr 
1090 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATGTTTTCAA 


CrCTTGffiTT 






GAGrrARTAC 


rTTrRTACRG 




GCGGAGGTCA 


GGTCAAATGT 


TCGTATCCAT 


TCCGCTTTTC 


CAGCTGTGCA 


CACAGCTACT 


120 


CGCAAAACCA 


ATCGCCTCAA 


TGTATCCATG 


ACCGCATTGT 


CCGACAAACA 


AACGGCTACT 


180 


GCGGGTAGTA 


CAGACAATCC 


GGACG6TATC 


GACTACAAGA 


CCTACGATTA 


CGTCGGAGTA 


240 


TGGGGTTTCA 


GCCCCCTCTC 


CAACACGAAC 


TG6TTTGCTG 


CCGGCTCTTC 


TACCCCGGGT 


300 


GGCATCACTG 


ATTGGACGGC 


TACAATGAAT 


GTCAACTTCG 


ACCGTATCGA 


CAATCCGTCC 


360 


ATCACT6TCC 


AGCATCCCGT 


TCAGGHCAG 


GTCAC6TCAT 


ACAACAACAA 


CAGCTACAGG 


420 


CnCGCTTCA 


ACCCTGATGG 


CCCTATTCGT 


GATGTGACTC 


GTGGGCCTAT 


CCTCAAGCA6 


480 


CAACTAGATT 


GGATTCGAAC 


GCAGGAGCTG 


TCAGAGGGAT 


GTGATCCCGG 


AATGACTTTC 


540 


ACATCAGAAG 


GTTTCTTGAC 


TTTT6A6ACC 


AAGGATCTAA 


GCGTCATCAT 


CTAC6GAAAT 


600 


TTCAAGACCA 


GAGHACGAG 


AAAGTCTGAC 


GGCAAGGTCA 


TCATGGAAAA 


TGATGAAGTT 


660 


GGAACTGCAT 


CGTCCGGGAA 


CAAGTGCCGG 


GGATTGATGT 


TCGTTGATAG 


ATTATAC6GT 


720 


AACGCTATCG 


CTTCCGTCAA 


CAAGAACTTC 


CGCAACGACG 


C6GTCAAGCA 


G6AGGGATTC 


780 


TATGGTGCAG 


GTGAAGTCAA 


CTGTAAGTAC 


CAGGACACCT 


ACATCTTAGA 


ACGCACTGGA 


840 


ATCGCCATGA 


CAAATTACAA 


CTAC6ATAAC 


TTGAACTATA 


ACCAGTG6GA 


CCTTAGACCT 


900 


CCGCATCATG 


AT6GTGCCCT 


CAACCCA6AC 


TATTATAnC 


CAATGTACTA 


CGCAGCACCT 


960 


TGGHGATCG 


TTAATGGATG 


CGCCG6TACT 


TCGGAGCAGT 


ACTCGTATGG 


ATGGTTCATG 


1020 
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GACAATGTCT 


CTCAATCTTA 


CATGAATACT 


GGAGATACTA 


CCTGGAATTC 


TGGACAAGAG 


1080 


GACCTGGCAT 


ACATGGGCGC 


GCAGTATGGA 


Cf.ATTTGACC 


AACATTTTGT 


TTACGGTGCT 


1140 


GGGGGTGGGA 


TGGAATGTGT 


GGTCACAGCG 


TTCTCTCTTC 


TACAAGGCAA 


GGAGTTCGAG 


1200 


AACCAAGTTC 


TCAACAAACG 


TTCAGTAATG 


CCTCCGAAA1 


ACGTCTTT6G 


TTTCTTCCAG 


1260 


GGIGTHTCG 


6GACTTCTTC 


CTTGTTGAGA 


GC6CATAT6C 


CAGCAGGTGA 


GAACAACATC 


1320 


TCAGTCGAAG 


AAAHGTAGA 


AG6TTATCAA 


AACAACAATT 


TCCCTTTCGA 


GGGGCTCGCT 


1380 


GTGGACGTGG 


ATATGCAAGA 


CAACTTGCGG 


GTGTTCACCA 


CGAAGGGCGA 


ATTTTGGACC 


1440 


GCAAACAGGG 


TGG6TACTGG 


CG66GATCCA 


AACAACCGAT 


CGGTirrTGA 


ATGGGCACAT 


1500 


GACAAAGGCC 


TIGTHGTCA 


GACAAATATA 


ACTTGcrrcc 


T6AGGAATGA 


TAACGAGGGG 


1560 


CAAGACTACG 


AGGTCAATCA 


GACGHAAGG 


GAGAGGCAGT 


TGTACACGAA 


GAACGACTCC 


1620 


CTGACGGGTA 


CGGAHTIGG 


AATGACCGAC 


GACGGCCCCA 


GCGATGCGTA 


CATCGGTCAT 


1680 


CTGGACTATG 


GGGGTGGAGT 


AGAATGTGAT 


GCACTTTTCC 


CAGACTGGGG 


ACGGCCTGAC 


1740 


GTGGCCGAAT 


GGTGGGGAAA 


TAACTATAA6 


AAACT6TTCA 


6CATTGGTCT 


CGACTTCGTC 


1800 


TGGCAAGACA 


TGACTGTTCC 


AGCAATGATG 


CCGCACAAAA 


TTGGCGATGA 


CATCAATGTG 


1860 


AAACCGGATG 


GGAATTGGCC 


GAATGCGGAC 


GATCC6TCCA 


ATGGACAATA 


CAACTGGAAG 


1920 


ACGTACCATC 


CCCAAGTGCT 


TGTAACTGAT 


AT6C6TTATG 


AGAATCATGG 


TCGGGAACCG 


1980 


ATGGTCACTC 


AACGCAACAT 


TCATGCGTAT 


ACACT6TGCG 


AGTCTACTAG 


GAAGGAAGGG 


2040 


ATCGTGGAAA 


ACGCAGACAC 


TCTAACGAAG 


TTCCGCCGTA 


GCTACATTAT 


CAGTCGTGGT 


2100 


GGTTACATTG 


GTAACCAGCA 


TTTCGGGGGT 


ATGTGGGTGG 


6AGACAACTC 


TACTACATCA 


2160 


AACTACATCC 


AAATGATGAT 


TGCCAACAAT 


ATTAACATGA 


ATATGTCTT6 


CTTGCCTCTC 


2220 


GTC66CTCCG 


ACAHGGAGG 


ATTCACCTCA 


TACGACAATG 


AGAATCAGCG 


AACGCC6TGT 


2280 


ACCGGGGACT 


TGAT6GTGAG 


GTATGTGCAG 


GCGGGCTGCC 


TGTTGCCGTG 


GTTCAGGAAC 


2340 


CACTATGATA 


GGTGGATCGA 


GTCCAAGGAC 


CACGGAAAGG 


ACTACCAGGA 


GCT6TACATG 


2400 


TATCC6AATG 


AAATGGATAC 


GTTGAGGAAG 


TTC6TTGAAT 


TCCGTTA7CG 


CTGGCAGGAA 


2460 


GTGTT6TACA 


CGGCCATGTA 


CCAGAATGCG 


GCTTTCGGAA 


AGCCGATTAT 


CAAGGCTGCT 


2520 


TCGATGTACA 


A7AACGACTC 


AAACGTTCGC 


AGGGCGCAGA 


ACGA7CATTT 


CCTTCTTGGT 


2580 


GGACATGATG 


GATATCGCAT 


TCTGTGCGCG 


CCTGTTGTGT 


GGGAGAATTC 


GACCGAACGC 


2640 


GAATTGTACT 


TGCCCGTGCT 


GACCCAATGG 


TACAAATTC6 


GTCCCGACTT 


TGACACCAAG 


2700 
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CCTCTG6AAG GAGCGATGAA CGGAGGGGAC CGAATTTACA ACTACCCTGT ACCGCAAAGT 2760 

GAATCACCAA TCT7CGTGAG AGAAGGTGCG ATTCTCCCTA CCCGCTACAC GTTGAACGGT 2820 

GAAAACAAAT CATTGAACAC GTACACGGAC 6AA6ATCCGT TGGT6TTTGA AGTATTCCCC 2880 

CTCGGAAACA ACCGTGCCGA CGGTATGTGT TATCHGATG ATGGCGGTGT GACCACCAAT 2940 

GCTGAAGACA ATGGCAAGTT CTCTGTCGTC AAGGTGGCAG C6GAGCAGGA TGGTGGTACG 3000 

GAGACGA7AA CGTTTACGAA TGATTGCTAT GAGTACGTTT TCGGTGGACC GTTCTACGT7 3060 

CGAGT6CGCG GCGCTCA6TC GCC6TCGAAC ATCCACGTGT CTTCT6GAGC GGGTTCTCAG 3120 

GACAT6AAGG TGAGCTCT6C CACHCCAGG GCTGCGCTGT TCAATGACGG GGAGAACGGT 3180 

GA7TTCTGGG HGACCAGGA GACAGAHCT C7G7GGC7GA AG77GCCCAA CGHGTrCTC 3240 

CCG6ACGCT6 TGATCACAAT TACCTAA 3267 
(2) INFORMATION FOR SEQ 10 NO: 4: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



ATGTATCCAA 


CCCTCACCTT 


C67G6CGCCT 


AG7GCGC7AG GGGCCAGAAC 7TTCACGTGT 


60 


GTGGGCATTT 


TTAGGTCACA 


CAHCTTAH 


CATTCGGTTG TTCCAGCGGT GCGTCTAGCT 


120 


GTGCGCAAAA 


GCAACCGCCT 


CAATGTATCC 


ATGTCCGCTT TGTTC6ACAA ACCGACTGCT 


180 


GHACTGGAG 


GGAAGGACAA 


CCCGGACAAT 


ATCAATTACA CCACTTATGA CTACGTCCCT 


240 


GTGTGGCGCT 


TCGACCCCCT 


CAGCAATACG 


AACTGGTTTG C7GCCGGA7C T7CCACTCCC 


300 


GGCGATATT6 


ACGACTG6AC 


GGC6ACAA7G 


AATG7GAAC7 TCGACCG7AT CGACAATCCA 


360 


TCCTTCACTC 


TCGAGAAACC 


GGTTCAGGTT 


CAG6TCAC6T CATACAA6AA CAATT6TTTC 


420 


AGGGTTCGCT 


TCAACCCTGA 


TGGTCCTAH 


CGCGATGTGG ATCGTGGGCC TATCCTCCAG 


480 


CAGCAACTAA 


ATTGGATCCG 


GAAGCAGGAG 


CAGTCGAAGG GGTTTGA7CC TAAGATGGGC 


540 


nCACAAAAG 


AAGGTTTCTT 


GAAATTTGAG 


ACCAAGGATC TGAACGHAT CATATATGGC 


600 


AATTTTAAGA 


CTAGAGTTAC 


GAGGAAGAGG 


GA7GGAAAA6 GGA7CATGGA GAA7AATGAA 


660 



SUBSTITUTE SHEET (RULE 26) 



wo 95/10616 



PCT/EP94/03397 



94 



GTGCCGGCAG 


GATCGTTAGG 


GAACAAGTGC 


CGGGGATTGA 


TGTTTGTCGA 


CAGGTTGTAC 


720 


GGCACTGCCA 


TCGCTTCCGT 


7AATGAAAAT 


TACCGCAACG 


AlCCCGACAG 


GAAAGAGGGG 


780 


TTCTATG6TG 


CAG6AGAAGT 


AAACTGCGAG 


TTTTGGGACT 


CCGAACAAAA 


CA6GAACAAG 


840 


TACATCHAG 


AAC6AACTGG 


AATCGCCATG 


ACAAATTACA 


ATTATGACAA 


CTATAACTAC 


900 


AACCAGTCAG 


ATCTTAHGC 


TCCAGGATAT 


CCTTCCGACC 


CGAACTTCTA 


CATTCCCATG 


960 


TATTTTGCAG 


CACCTT66GT 


AGTTGTTAAG 


GGATGCAGTG 


GCAACAGCGA 


TGAACAGTAC 


1020 


TCGTAC6GAT 


GGHTAIGGA 


TAATGTCTCC 


CAAACHACA 


TGAATACTGG 


TGG7ACTTCC 


1080 


T6GAACTGTG 


GAGAGGAGAA 


CTT6GCATAC 


ATGGGAGCAC 


AGT6CGGTCC 


ATTTGACCAA 


1140 


CATTTTGTGT 


ATGGTGATGG 


AGATGGTCTT 


GAGGATGHG 


TCCAAGCGH 


CTCTCTTCTG 


1200 


CAAGGCAAAG 


AGTTTGAGAA 


CCAAGTTCTG 


AACAAACGT6 


CCGTAATGCC 


TCCGAAATAT 


1260 


6TGTTTGGTT 


ACmCAGGG 


AGTCTTTGGG 


ATTGCTTCCT 


TGHGAGAGA 


GCAAAGACCA 


1320 


GAGGGTGGTA 


ATAACATCTC 


TGTTCAA6AG 


ATTGTCGAAG 


GHACCAAAG 


CAATAACTTC 


1380 


CCTTTAGAGG 


GGHAGCCGT 


AGAT6TGGAT 


ATGCAACAAG 


ATTTGCGC6T 


GTTCACCAC6 


1440 


AAGATTGAAT 


nTGGACGGC 


AAATAAGGTA 


6GCACCGGGG 


GAGACTCGAA 


TAACAAGTCG 


1500 


GTGTTTGAAT 


6G6CACATGA 


CAAAGGCCTT 


GTATGTCAGA 


CGAATGTTAC 


TTGCTTCTTG 


1560 


AGAAACGACA 


ACGGCGGGGC 


A6ATTACGAA 


GTCAATCAGA 


CATTGAGGGA 


GAAG6GTTTG 


1620 


TACAC6AAGA 


ATGACTCACT 


GAC6AACACT 


AACTTC6GAA 


CTACCAACGA 


CGGGCCGAGC 


1680 


GATGCGTACA 


TTGGACATCT 


GGACTATGGT 


GGCGGAGGGA 


ATTGTGATGC 


ACTTTTCCCA 


1740 


GACTG6GGTC 


GACCGGGTGT 


GGCTGAATGG 


TGG66TGATA 


ACTACAGCAA 


GCTCTTCAAA 


1800 


ATTGGTCTGG 


ATTTCGTCTG 


GCAAGACAT6 


ACAGTTCCAG 


CTATGATGCC 


ACACAAAGTT 


1860 


GGCGACGCAG 


TCGATACGAG 


ATCACCTTAC 


GGCTGGCCGA 


ATGAGAATGA 


TCCTTCGAAC 


1920 


GGACGATACA 


ATTGGAAATC 


TTACCATCCA 


CAAGTTCTCG 


TAACTGATAT 


GCGATAIGAG 


1980 


AATCATG6AA 


GG6AACCGAT 


GTTCACTCAA 


C6CAATATGC 


ATGCGTACAC 


ACTCTGTGAA 


2040 


TCTACGAGGA 


AGGAAGGGAT 


TGTTGCAAAT 


GCAGACACTC 


TAACGAAGTT 


CCGCCGCAGT 


2100 


TATATTATCA 


GTCGTGGAGG 


TTACAnGGC 


AACCAGCATT 


TTGGAGGAAT 


GTGGGTTGGA 


2160 


6ACAACTCTT 


CCTCCCAAAG 


ATACCTCCAA 


ATGATGATCG 


CGAACATCGT 


CAACATGAAC 


2220 


ATGTCTTGCC 


TTCCACTAGT 


TGGGTCCGAC 


ATTGGAGGTT 


HACTTCGTA 


TGATGGACGA 


2280 


AACGTGTGTC 


CCGGGGATCT 


AATG6TAAGA 


TTCGTGCAGG 


CG6GTTGCTT 


ACTACCGTGG 


2340 
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ACTATGGTAG 


95 

GTTGGTCGAG GGCAAGCAAG AGGGAAAATA CTATCAAGAA 


2400 


CTGTACATGT 


ACAAGGACGA 


GAIGGCTACA TTGAGAAAAT TCATTGAATT CCGTTACCGC 


2450 


TGGCAGGAGG 


TGTTGTACAC 


TGCTATGTAC CAGAATGCGG CTTTCGGGAA ACCGATTATC 


2520 


AAGGCAGCTT 


CCATGTACGA 


CAACGACAGA AACGTTCGCG GCGCACAGGA TGACCACTTC 


2580 


CTTCTCGGCG 


GACACGATGG 


ATATCGTATT TTGTGTGCAC CTGTTGTGTG GGAGAATACA 


2640 


ACCAGTCGCG 


ATCTGTACTT 


GCCTGT6CT6 ACCAAATG6T ACAAATTCG6 CCCT6ACTAT 


2700 


GACACCAAGC 


GCCTGGATTC 


r»TT^^ AT A 0 f^Or* A ^ A "IT/^ A T*T A A O A A ATTr*T#»T^ 

TGCGTTGGAT GGAGGGCAGA TGATTAAGAA CTATTCTGTG 


2760 


CCACAAAGCG 


ACTCTCCGAT 


ATTTGTGAGG GAAGGAGCTA TTCTCCCTAC CCGCTACACG 


2820 


TT6GACGGTT 


AAA AAA A A 

C6AACAAGTC 


AATGAACACG TACACAGACA AAGACCC6TT GGTGTTT6AG 


2880 


6TATTCCCTC 


TTGGAAACAA 


CCGTGCCGAC GGTATGTGTT ATCHGATGA TGGCGGTATT 


2940 


ACTACAGATG 


CTGAGGACCA 


TG6CAAATTC TCTGTTATCA ATGTCGAAGC CTTACGGAAA 


3000 


GGTGTTACGA 


CGACGATCAA 


GTTTGCGTAT GACACHATC AATACGTATT TGATGGTCCA 


3060 


nCTACGTTC 


GAATCCGTAA 


TCTTACGACT GCATCAAAAA TTAACGTGTC TTCTGGAGCG 


3120 


GGTGAAGAGG 


ACATGACACC 


GACCTCT6CG AACTCGAGGG CAGCinOTT CAGTGATGGA 


3180 


GGTGHGGAG 


AATACTGGGC 


TGACAATGAT ACGTCTTCTC T6T6GATGAA 6TTGCCAAAC 


3240 


CT6GTTCTGC 


AA6AC6CTGT 


GATTACCAH ACGTAG 


3276 



(2) INFORMATION FOR SEQ ID NO: 5: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1066 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Gly Phe Ser Asp Pro Leu Asn Phe Cys Lys Ala Glu Asp Tyr 

15 10 15 

Tyr Ser Val Ala Leu Asp Trp Lys Gly Pro Gin Lys He He Gly Val 

20 25 30 

Asp Thr Thr Pro Pro Lys Ser Thr Lys Phe Pro Lys Asn Trp His Gly 

35 40 45 

Val Asn Leu Arg Phe Asp Asp Gly Thr Leu Gly Val Val Gin Phe He 

50 55 60 
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Arg Pro Cys Val Trp Arg Val Arg Tyr Asp Pro Gly Phe Lys Thr Ser 
65 70 75 80 

Asp Glu Tyr Gly Asp Glu Asn Thr Arg Thr He Val Gin Asp Tyr Met 
85 90 95 

Ser Thr Leu Ser Asn Lys Leu Asp Thr Tyr Arg Gly Leu Thr Irp Glu 
100 105 110 

Thr Lys Cys Glu Asp Ser Gly Asp Phe Phe Thr Phe Ser Ser Lys Val 
115 120 125 

Thr Ala Val Glu Lys Ser Glu Arg Thr Arg Asn Lys Val Gly Asp Gly 
130 135 140 

Leu Arg He His Leu Trp Lys Ser Pro Phe Arg He Gin Val Val Arg 
145 150 155 160 

Thr Leu Thr Pro Leu Lys Asp Pro Tyr Pro He Pro Asn Val Ala Ala 
165 170 175 

Ala Glu Ala Arg Val Ser Asp Lys Val Val Trp Gin Thr Ser Pro Lys 
180 185 190 

Thr Phe Arg Lys Asn Leu His Pro Gin His Lys Met Leu Lys Asp Thr 
195 200 205 

Val Leu Asp He Val Lys Pro Gly His Gly Glu Tyr Val Gly Trp Gly 
210 215 220 

Glu Met Gly Gly He Gin Phe Met Lys Glu Pro Thr Phe Met Asn Tyr 
225 230 235 240 

Phe Asn Phe Asp Asn Met Gin Tyr Gin Gin Val Tyr Ala Gin Gly Ala 
245 250 255 

Leu Asp Ser Arg Glu Pro Leu Tyr His Ser Asp Pro Phe Tyr Leu Asp 
260 265 270 

Val Asn Ser Asn Pro Glu His Lys Asn He Thr Ala Thr Phe He Asp 
275 280 285 

Asn Tyr Ser Gin He Ala He Asp Phe Gly Lys Thr Asn Ser Gly Tyr 
290 295 300 

He Lys Leu Gly Thr Arg Tyr Gly Gly He Asp Cys Tyr Gly He Ser 
305 310 315 320 

Ala Asp Thr Val Pro Glu He Val Arg Leu Tyr Thr Gly Leu Val Gly 
325 330 335 

Arg Ser Lys Leu Lys Pro Arg Tyr He Leu Gly Ala His Gin Ala Cys 
340 345 350 

Tyr Gly Tyr Gin Gin Glu Ser Asp Leu Tyr Ser Val Val Gin Gin Tyr 
355 360 365 
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Arg Asp Cys Lys Phe Pro Leu Asp Gly He His Val Asp Val Asp Val 
370 375 380 

Gin Asp Gly Phe Arg Thr Phe Thr Thr Asn Pro His Thr Phe Pro Asn 
385 390 395 400 

Pro Lys Glu Met Phe Thr Asn Leu Arg Asn Asn Gly He Lys Cys Ser 
405 410 415 

Thr Asn He Thr Pro Val He Ser He Asn Asn Arg Glu Gly Gly Tyr 
420 425 430 

Ser Thr Leu Leu Glu Gly Val Asp Lys Lys Tyr Phe He Met Asp Asp 
435 440 445 

Arg Tyr Thr Glu Gly Thr Ser Gly Asn Ala Lys Asp Val Arg Tyr Met 
450 455 460 

Tyr Tyr Gly Gly Gly Asn Lys Val Glu Val Asp Pro Asn Asp Val Asn 
465 470 475 480 

Gly Arg Pro Asp Phe Lys Asp Asn Tyr Asp Phe Pro Ala Asn Phe Asn 
485 490 495 

Ser Lys Gin Tyr Pro Tyr His Gly Gly Val Ser Tyr Gly Tyr Gly Asn 
500 505 510 

Gly Ser Ala Gly Phe Tyr Pro Asp Leu Asn Arg Lys Glu Val Arg He 
S15 520 525 

Trp Trp Gly Met Gin Tyr Lys Tyr Leu Phe Asp Met Gly Leu Glu Phe 
530 535 S40 

Val Trp Gin Asp Met Thr Thr Pro Ala He His Thr Ser Tyr Gly Asp 
545 550 555 560 

Met Lys Gly Leu Pro Thr Arg Leu Leu Val Thr Ser Asp Ser Val Thr 
565 570 575 

Asn Ala Ser Glu Lys Lys Leu Ala He Glu Thr Trp Ala Leu Tyr Ser 
580 585 590 

Tyr Asn Leu His Lys Ala Thr Trp His Gly Leu Ser Arg Leu Glu Ser 
595 600 605 

Arg Lys Asn Lys Arg Asn Phe He Leu Gly Arg Gly Ser Tyr Ala Gly 
610 615 620 

Ala Tyr Arg Phe Ala Gly Leu Trp Thr Gly Asp Asn Ala Ser Asn Trp 
625 630 635 640 

Glu Phe Trp Lys He Ser Val Ser Gin Val Leu Ser Leu Gly Leu Asn 
645 650 655 

Gly Val Cys He Ala Gly Ser Asp Thr Gly Gly Phe Glu Pro Tyr Arg 
660 665 670 
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Asp Ala Asn Gly Val 61u 61u Lys Tyr Cys Ser Pro Glu Leu Leu lie 
675 680 . 685 

Arg Irp Tyr Thr Gly Ser Phe Leu Leu Pro Trp Leu Arg Asn His Tyr 
690 695 700 

Val Lys Lys Asp Arg Lys Trp Phe Gin Glu Pro Tyr Ser Tyr Pro Lys 
705 710 715 720 

His Leu Glu Thr His Pro Glu Leu Ala Asp Gin Ala Trp Leu Tyr Lys 
725 730 735 

Ser Val Leu Glu He Cys Arg Tyr Tyr Val Glu Leu Arg Tyr Ser Leu 
740 745 750 

He Gin Leu Leu Tyr Asp Cys Net Phe Gin Asn Val Val Asp Gly Met 
755 760 765 

Pro He Thr Arg Ser Met Leu Leu Thr Asp Thr Glu Asp Thr Thr Phe 
770 775 780 

Phe Asn Glu Ser Gin Lys Phe Leu Asp Asn Gin Tyr Net Ala G1y Asp 
785 790 795 800 

Asp lie Leu Val Ala Pro lie Leu His Ser Arg Lys Glu lie Pro Gly 
805 810 815 

Glu Asn Arg Asp Val Tyr Leu Pro Leu Tyr His Thr Trp Tyr Pro Ser 
820 825 830 

Asn Leu Arg Pro Trp Asp Asp Gin Gly Val Ala Leu Gly Asn Pro Val 
835 840 845 

Glu Gly Gly Ser Val He Asn Tyr Thr Ala Arg He Val Ala Pro Glu 
850 855 860 

Asp Tyr Asn Leu Phe His Ser Val Val Pro Val Tyr Val Arg Glu Gly 
865 870 875 880 

Ala He He Pro Gin He Glu Val Arg Gin Trp Thr Gly Gin Gly Gly 
885 890 895 

Ala Asn Arg He Lys Phe Asn He Tyr Pro Gly Lys Asp Lys Glu Tyr 
900 905 910 

Cys Thr Tyr Leu Asp Asp Gly Val Ser Arg Asp Ser Ala Pro Glu Asp 
915 920 925 

Leu Pro Gin Tyr Lys Glu Thr His Glu Gin Ser Lys Val Glu Gly Ala 
930 935 940 

Glu He Ala Lys Gin He Gly Lys Lys Thr Gly Tyr Asn He Ser Gly 
945 950 955 960 

Thr Asp Pro Glu Ala Lys Gly Tyr His Arg Lys Val Ala Val Thr Gin 
965 970 975 
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Ihr Ser Lys Asp Lys Thr Arg Thr Val Thr lie Glu Pro Lys His Asn 
980 985 990 

Gly Tyr Asp. Pro Ser Lys Glu Val Gly Asp Tyr Tyr Thr lie He Leu 
995 1000 1005 

Trp Tyr Ala Pro Gly Phe Asp Gly Ser He Val Asp Val Ser Lys Thr 
1010 1015 1020 

Thr Val Asn Val Glu Gly Gly Val Glu His Gin Val Tyr Lys Asn Ser 
1025 1030 1035 1040 

Asp Leu His Thr Val Val lie Asp Val Lys Glu Val He Gly Thr Thr 
1045 1050 1055 

Lys Ser Val Lys He Thr Cys Thr Ala Ala 
1060 1065 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1070 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Gly Leu Ser Asp Pro Leu Asn Phe Cys Lys Ala Glu Asp Tyr 
15 10 15 

Tyr Ala Ala Ala Lys Gly Trp Ser Gly Pro Gin Lys He He Arg Tyr 
20 25 30 

Asp Gin Thr Pro Pro Gin Gly Thr Lys Asp Pro Lys Ser Trp His Ala 
35 40 45 

Val Asn Leu Pro Phe Asp Asp Gly Thr Met Cys Val Val Gin Phe Val 
50 55 60 

Arg Pro Cys Val Trp Arg Val Arg Tyr Asp Pro Ser Val Lys Thr Ser 
65 70 75 80 

Asp Glu Tyr Gly Asp Glu Asn Thr Arg Thr He Val Gin Asp Tyr Met 
85 90 95 

Thr Thr Leu Val Gly Asn Leu Asp He Phe Arg Gly Leu Thr Trp Val 
100 105 110 

Ser Thr Leu Glu Asp Ser Gly Glu Tyr Tyr Thr Phe Lys Ser Glu Val 
115 120 125 
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Thr Ala Val Asp Glu 7hr Glu Arg Thr Arg Asn Lys Val Gly Asp Gly 
130 135 140 

Leu Lys lie Tyr Leu Trp Lys Asn Pro Phe Arg He Gin Val Val Arg 
145 150 155 160 

Leu Leu Thr Pro Leu Val Asp Pro Phe Pro He Pro Asn Val Ala Asn 
165 170 175 

Ala Thr Ala Arg Val Ala Asp Lys Val Val Trp Gin Thr Ser Pro Lys 
180 185 190 

Thr Phe Arg Lys Asn Leu His Pro Gin His Lys Met Leu Lys Asp Thr 
195 200 205 

Val Leu Asp He He Lys Pro Gly His Gly Glu Tyr Val Gly Trp Gly 
210 215 220 

Glu Met Gly Gly He Glu Phe Met Lys Glu Pro Thr Phe Met Asn Tyr 
225 230 235 240 

Phe Asn Phe Asp Asn Met Gin Tyr Gin Gin Val Tyr Ala Gin Gly Ala 
245 250 255 

Leu Asp Ser Arg Glu Pro Leu Tyr His Ser Asp Pro Phe Tyr Leu Asp 
260 265 270 

Val Asn Ser Asn Pro Glu His Lys Asn He Thr Ala Thr Phe He Asp 
275 280 285 

Asn Tyr Ser Gin He Ala He Asp Phe Gly Lys Thr Asn Ser Gly Tyr 
290 295 300 

He Lys Leu Gly Thr Arg Tyr Gly Gly He Asp Cys Tyr Gly He Ser 
305 310 315 320 

Ala Asp Thr Val Pro Glu He Val Arg Leu Tyr Thr Gly Leu Val Gly 
325 330 335 

Arg Ser Lys Leu Lys Pro Arg Tyr He Leu Gly Ala His Gin Ala Cys 
340 345 350 

Tyr Gly Tyr Gin Gin Glu Ser Asp Leu His Ala Val Val Gin Gin Tyr 
355 360 365 

Arg Asp Thr Lys Phe Pro Leu Asp Gly Leu His Val Asp Val Asp Phe 
370 375 380 

Gin Asp Asn Phe Arg Thr Phe Thr Thr Asn Pro He Thr Phe Pro Asn 
385 390 395 400 

Pro Lys Glu Met Phe Thr Asn Leu Arg Asn Asn Gly He Lys Cys Ser 
405 410 415 

Thr Asn He Thr Pro Val He Ser He Arg Asp Arg Pro Asn Gly Tyr 
420 425 ' 430 
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Ser Thr Leu Asn Glu Gly Tyr Asp Lys Lys Tyr Phe lie Met Asp Asp 
435 440 445 

Arg Tyr Thr Glu Gly Thr Ser Gly Asp Pro Gin Asn Val Arg Tyr Ser 
450 455 460 

Phe Tyr Gly Gly Gly Asn Pro Val Glu Val Asn Pro Asn Asp Val Trp 
465 470 475 480 

Ala Arg Pro Asp Phe Gly Asp Asn Tyr Asp Phe Pro Thr Asn Phe Asn 
485 490 495 

Cys Lys Asp Tyr Pro Tyr His Gly Gly Val Ser Tyr Gly Tyr Gly Asn 
500 505 510 

Gly Thr Pro Gly Tyr Tyr Pro Asp Leu Asn Arg Glu Glu Val Arg He 
515 520 525 

Trp Trp Gly Leu Gin Tyr Glu Tyr Leu Phe Asn Met Gly Leu Glu Phe 
530 535 540 

Val Trp Gin Asp Met Thr Thr Pro Ala He His Ser Ser Tyr Gly Asp 
545 550 555 560 

Met Lys Gly Leu Pro Thr Arg Leu Leu Val Thr Ala Asp Ser Val Thr 
565 570 575 

Asn Ala Ser Glu Lys Lys Leu Ala lie Glu Ser Trp Ala Leu Tyr Ser 
580 585 590 

Tyr Asn Leu His Lys Ala Thr Phe His Gly Leu Gly Arg Leu Glu Ser 
595 600 605 

Arg Lys Asn Lys Arg Asn Phe He Leu Gly Arg Gly Ser Tyr Ala Gly 
610 615 620 

Ala Tyr Arg Phe A1a Gly Leu Trp Thr Gly Asp Asn Ala Ser Thr Trp 
625 630 635 640 

Glu Phe Trp Lys He Ser Val Ser Gin Val Leu Ser Leu Gly Leu Asn 
645 650 655 

Gly Val Cys He Ala Gly Ser Asp Thr Gly Gly Phe Glu Pro Ala Arg 
660 665 670 

Thr Glu He Gly Glu Glu Lys Tyr Cys Ser Pro Glu Leu Leu He Arg 
675 680 685 

Trp Tyr Thr Gly Ser Phe Leu Leu Pro Trp Leu Arg Asn His Tyr Val 
690 695 700 

Lys Lys Asp Arg Lys Trp Phe Gin Glu Pro Tyr Ala Tyr Pro Lys His 
705 710 715 720 

Leu Glu Thr His Pro Glu Leu Ala Asp Gin Ala Trp Leu Tyr Lys Ser 
725 730 735 
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Val Leu Glu lie Cys Arg Tyr Trp Val Glu Leu Arg Tyr Ser Leu lie 
740 745 750 

Gin Leu Leu Tyr Asp Cys Met Phe Gin Asn Val Val Asp Gly Met Pro 
755 760 765 

Leu Ala Arg Ser Met Leu Leu Thr Asp Thr Glu Asp Thr Thr Phe Phe 
770 775 780 

Asn Glu Ser Gin Lys Phe Leu Asp Asn Gin Tyr Met Ala Gly Asp Asp 
785 790 795 800 

lie Leu Val Ala Pro lie Leu His Ser Arg Asn Glu Val Pro Gly Glu 
805 810 815 

Asn Arg Asp Val Tyr Leu Pro Leu Phe His Thr Trp Tyr Pro Ser Asn 
820 825 830 

Leu Arg Pro Trp Asp Asp Gin Gly Val Ala Leu Gly Asn Pro Val Glu 
835 . 840 845 

Gly Gly Ser Val He Asn Tyr Thr Ala Arg lie Val Ala Pro Glu Asp 
850 855 860 

Tyr Asn Leu Phe His Asn Val Val Pro Val Tyr He Arg Glu Gly Ala 
865 870 875 880 

He He Pro Gin He Gin Val Arg Gin Trp He Gly Glu Gly Gly Pro 
885 890 895 

Asn Pro He Lys Phe Asn He Tyr Pro Gly Lys Asp Lys Glu Tyr Val 
900 905 910 

Thr Tyr Leu Asp Asp Gly Val Ser Arg Asp Ser Ala Pro Asp Asp Leu 
915 920 925 

Pro Gin Tyr Arg Glu Ala Tyr Glu Gin Ala Lys Val Glu Gly Lys Asp 
930 935 940 

Val Gin Lys Gin Leu Ala Val He Gin Gly Asn Lys Thr Asn Asp Phe 
945 950 955 960 

Ser Ala Ser Gly He Asp Lys Glu Ala Lys Gly Tyr His Arg Lys Val 
955 970 975 

Ser He Lys Gin Glu Ser Lys Asp Lys Thr Arg Thr Val Thr He Glu 
980 985 990 

Pro Lys His Asn Gly Tyr Asp Pro Ser Lys Glu Val Gly Asn Tyr lyr 
995 1000 1005 

Thr He He Leu Trp Tyr Ala Pro Gly Phe Asp Gly Ser He Val Asp 
1010 1015 1020 
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Val Ser Gin Ala Thr Val Asn lie Glu Gly Gly Val Glu Cys Glu lie 
1025 1030 1035 1040 

Phe Lys Asn Thr Gly Leu His Thr Val Val Val Asn Val Lys 61u Val 
1045 1050 1055 

lie Gly Thr Thr Lys Ser Val Lys lie Thr Cys Thr Thr Ala 
1060 1065 1070 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ATG6CAGGAT 


TTTCTGATCC 


TCTCAACTTT 


TGCAAAGCAG 


AAGACTACTA 


CAGTGTTGCG 


60 


CTAGACTGGA 


AGG6CCCTCA 


AAAAATCATT 


GGAGTAGACA 


CTACTCCTCC 


AAAGAGCACC 


120 


AAGTTCCCCA 


AAAACTGGCA 


TGGAGTGAAC 


TTGAGATTCG 


ATGATGGGAC 


TTTAGGTGTG 


180 


GTTCAGTTCA 


TTAGGCCGTG 


CGHTGGAGG 


GTTAGATACG 


ACCCTGGTTT 


CAAGACCTCT 


240 


GACGAGTATG 


GTGATGAGAA 


TACGAGGACA 


ATTGTGCAAG 


ATTATATGAG 


TACTCTGAGT 


300 


AATAAATTGG 


ATACHATAG 


AGGTCnACG 


TGGGAAACCA 


AGT6TGAGGA 


TTCGGGAGAT 


360 


TTCTTTACCT 


TCTCATCCAA 


GGTCACCGCC 


GTTGAAAAAT 


CCGAGCGGAC 


CCGCAACAAG 


420 


GTCGGCGATG 


GCCTCAGAAT 


TCACCTATGG 


AAAAGCCCTT 


TCCGCATCCA 


AGTAGTGCGC 


480 


ACCHGACCC 


CTTTGAAGGA 


TCCTTACCCC 


ATTCCAAATG 


TAGCCGCAGC 


CGAAGCCCGT 


540 


GTGTCCGACA 


AGGTCGTnG 


GCAAAC6TCT 


CCCAAGACAT 


TCAGAAAGAA 


CCTGCATCCG 


600 


CAACACAAGA 


TGCTAAAGGA 


TACAGHCTT 


GACATTGTCA 


AACCTGGACA 


TGGCGAGTAT 


660 


GT6GGGTGGG 


GAGAGATGGG 


AGGTATCCAG 


HTATGAAGG 


AGCCAACATT 


CAT6AACTAT 


720 


TTTAACTTCG 


ACAATATGCA 


ATACCAGCAA 


GTCTATGCCC 


AAG6TGCTCT 


CGATTCTC6C 


780 


GA6CCACTGT 


ACCACTCGGA 


TCCCTTCTAT 


CTTGATGTGA 


ACTCCAACCC 


GGAGCACAAG 


840 


AATATCACGG 


CAACCTTTAT 


C6ATAACTAC 


TCTCAAATTG 


CCATCGACTT 


TGGAAAGACC 


900 


AACTCAGGCT 


ACATCAAGCT 


GGGAACCAG6 


TATGGTGGTA 


TCGATTGTTA 


CGGTATCAGT 


960 


GCGGATACGG 


TCCCGGAAAT 


TGTACGACTT 


TATACAGGTC 


TTGTTGGACG 


TTCAAAGTTG 


1020 
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AAGCCCAGAT 


ATATTCTCGG 


GGCCCATCAA 


GCCTGTTATG 


GATACCAACA 


GGAAAGTGAC 


1080 


TTGTATTCTG 


TGGTCCAGCA 


GTACCGTGAC 


TGTAAATTTC 


CACTTGACGG 


GATTCACGTC 


1140 


GA 1 b 1 tuAl la 




CI 1 CAGAAL 1 


TTC f\ r r ^ r r a 
I J LALLALLA 


ALLLALAtAL 


1 1 1 LLL 1 AAL 


1200 


LLLAAAtiAtiA 


TPTTT ATT A A 

Ibi 1 1 At lAA 


PTTrAr^A AT 

L 1 1 bAubAA 1 


AA 1 bbAA 1 LA 


ATTrrTrr Ar 
Ab ibt ILLAL 


r A AT ATf ATT 

LAA 1 A 1 LAL i 


1260 


CCTGTTATCA 


GCATTAACAA 


CA6AGAGGGT 


GGATACAGTA 


CCCTCCTTGA 


GGGAGTTGAC 


1320 


AAAAAATACT 


TTATCATGGA 


C6ACA6ATAT 


ACCGAGGGAA 


CAAGTGGGAA 


TGCGAA66AT 


1380 


GTTCGGTACA 


TGTACTACGG 


TGGTGGTAAT 


AAGGTTGAGG 


TCGATCCTAA 


TGATGTTAAT 


1440 


GGTCGGCCAG 


ACTTTAAAGA 


CAACTATGAC 


TTCCCCGCGA 


ACTTCAACAG 


CAAACAATAC 


1500 


CCCTATCATG 


6TGGTGTGAG 


CTAC6GTTAT 


GG6AACGGTA 


GTGCAG6TTT 


TTACCCGGAC 


1560 


CTCAACAGAA 


AGGAGGTTCG 


TATCT6GT6G 


6GAATGCAGT 


ACAAGTATCT 


CTTCGATATG 


1620 


GGACT6GAAT 


TT6TGTGGCA 


AGACAT6ACT 


ACCCCAGCAA 


TCCACACATC 


ATATGGAGAC 


1680 


ATGAAAGGGT 


TGCCCACCCG 


TCTACTCGTC 


ACCTCAGACT 


CCGTCACCAA 


T6CCTCTGAG 


1740 


AAAAAGCTCG 


CAAHGAAAC 


TTGGGCTCTC 


TACTCCTACA 


ATCTCCACAA 


AGCAACTTGG 


1800 


CATGGTCTTA 


GTCGTCTCGA 


ATCTCGTAAG 


AACAAACGAA 


ACTTCATCCT 


CGGGCGTGGA 


1860 


A6TTATGCCG 


GA6CCTATCG 


TTTTGCTGGT 


CTCT6GACTG 


GGGATAATGC 


AAGTAACTGG 


1920 


GAATTCTGGA 


A6ATATC6GT 


CTCTCAAGTT 


CTTTCTCTGG 


GCCTCAATGG 


TGTGT6CATC 


1980 


GCG6GGTCTG 


ATACGG6T6G 


THTGAACCC 


TACCGTGATG 


CAAATGGGGT 


CGAGGAGAAA 


2040 


TACTGTAGCC 


CAGAGCTACT 


CATCAGGTGG 


TATACTGGTT 


CATTCCTCTT 


GCC6TGGCTC 


2100 


AG6AACCATT 


ATGTCAAAAA 


G6ACAG6AAA 


TGGTTCCA6G 


AACCATACTC 


GTACCCCAAG 


2160 


CATCTTGAAA 


CCCATCCAGA 


ACTCGCAGAC 


CAAGCATGGC 


TCTATAAATC 


CGTTTTGGAG 


2220 


ATCTGTAGG7 


ACTATGTG6A 


GCTTAGATAC 


TCCCTCATCC 


AACTACTTTA 


CGACTGCATG 


2280 


TTTCAAAACG 


TAGTCGACGG 


TAT6CCAATC 


ACCA6ATCTA 


TGCTCTTGAC 


CGATACTGAG 


2340 


GATACCACCT 


TCTTCAACGA 


GAGCCAAAAG 


TTCCTCGACA 


ACCAATATAT 


GGCTGGTGAC 


2400 


GACATTCTTG 


HGCACCCAT 


CCTCCACAGT 


C6CAAAGAAA 


TTCCAGGCGA 


AAACAGAGAT 


2460 


GTCTATCTCC 


CTCTTTACCA 


CACCTGGTAC 


CCCTCAAATT 


TGAGACCATG 


GGACGA7CAA 


2520 


GGA6TCGCTT 


TGGGGAATCC 


TGTCGAAGGT 


GGTAGTGTCA 


TCAATTATAC 


TGCTA6GATT 


2580 


GTTGCACCCG 


AGGATTATAA 


TCTCnCCAC 


AGCGTGGTAC 


CAGTCTACGT 


TAGAGAGGGT 


2640 


GCCATCATCC 


CGCAAATCGA 


AGTACGCCAA 


TGGACTGGCC 


AGGGGGGAGC 


CAACCGCATC 


2700 
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AAGTTCAACA TCTACCCTGG AAA6GATAA6 GAGTACTGTA CCTATCTTGA TGATGGTGTT 2760 

AGCCGTGATA GTGCGCCGGA AGACCTCCCA CAGTACAAAG AGACCCACGA ACAGTCGAAG 2820 

GTTGAAGGCG CGGAAATCGC AAAGCAGATT GGAAAGAAGA CGGGTTACAA CATCTCAGGA 2880 

ACCGACCCAG AAGCAAAGGG TTATCACCGC AAAGTTGCTG TCACACAAAC GTCAAAAGAC 2940 

AA6ACGC6TA CTGTCACTAT T6A6CCAAAA CACAATG6AT ACGACCCTTC CAAAGA6GTG 3000 

GGTGATTATT ATACCATCAT TCITTGGTAC GCACCAGGTT TCGATGGCAG CATCGTCGAT 3060 

GTGAGCAAGA CGACTGTGAA TGTTGAG6GT GGGGTGGAGC ACCAAGHTA TAAGAACTCC 3120 

GATTTACATA CGGTTGTTAT CGACGTGAAG GAGGTGATCG GTACCACAAA GAGCGTCAAG 3180 

ATCACATGTA CTGCCGCTTA A 3201 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3213 base' pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



ATGGCAGGAT 


TATCCGACCC 


TCTCAATTTC 


TGCAAAGCAG A6GACTACTA CGCTGCTGCC 


60 


AAAGGCTGGA 


GTGGCCCTCA 


GAAGATCATT 


CGCTATGACC AGACCCCTCC TCAGGGTACA 


120 


AAAGATCCGA 


AAAGCTGGCA 


TGCGGTAAAC 


CTTCCHTCG ATGACGGGAC TATGTGTGTA 


180 


GTGCAATTCG 


TCAGACCCTG 


TGTTTGGA6G 


GTTAGATATG ACCCCAGTGT CAAGACTTCT 


240 


6ATGAGTACG 


GCGATGA6AA 


TACGAGGACT 


ATT6TACAA6 ACTACAT6AC TACTCTGGTT 


300 


GGAAACHGG 


ACATTTTCAG 


AGGTCTTAC6 


TGGGHTCTA CGTTGGAGGA TTCGG6CGAG 


360 


TACTACACCT 


TCAAGTCCGA 


AGTCACTGCC 


GT6GACGAAA CCGAACGGAC TC6AAACAAG 


420 


GTCGGCGACG 


GCCTCAAGAT 


TTACCTATGG 


AAAAATCCCT TTC6CATCCA GGTA6TGCGT 


480 


CTCTTGACCC 


CCCTGGTGGA 


CCCTTTCCCC 


ATTCCCAACG TAGCCAATGC CACAGCCCGT 


540 


GTGGCCGACA 


AGGTTGTTTG 


GCAGACGTCC 


CCGAAGACGT TCAGGAAAAA CTTGCATCCG 


600 


CAGCATAAGA 


TGTTGAAGGA 


TACAGTTCTT 


GATATTATCA AGCC6GGGCA CGGAGAGTAT 


660 


GTGGGTT6GG 


6AGAGATGGG 


A6GCATCGAG 


TTTATGAAGG AGCCAACATT CATGAATTAT 


720 
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TTCAACTTTG ACAATA7GCA ATATCAGCAG GTCTATGCAC AAGGCGCTCT TGATAGICGT 780 

GAGCCGTTGT ATCACTCTGA TCCCTTCTAT CICGACGTGA ACTCCAACCC AGAGCACAAG 84D 

AACATTACGG CAACCTTTAT CGATAACTAC TCTCAGATTG CCATCGACTT TGGGAAGACC 900 

AACTCAGGCT ACATCAAGCT G6GTACCAGG TATGGCGGTA TCGATTGTTA CGGTATCAGC 960 

GCGGATACGG TCCCGGAGAT T6TGCGACTT TATACT6GAC TTGTTGGGCG TTCGAAGTTG 1020 

AAGCCCAGGT ATATTCTCG6 AGCCCACCAA GCnSHATG GATACCAGCA GGAAAGTGAC 1080 

TTGCATGCTG TTGTTCAGCA GTACCGTGAC ACCAAGrTTC CGCTTGATGG GTTGCATGTC 1140 

GATGTCGACT TTCAG6ACAA TTTCAGAACG TTTACCACTA ACCCGATTAC GTTCCCTAAT 1200 

CCCAAAGAAA TGTTTACCAA TCTAAGGAAC AATG6AATCA AGTGTTCCAC CAACATCACC 1260 

CCTGTTATCA GTATCAGA6A TCGCCCGAAT GGGTACAGTA CCCTCAA7GA GGGATATGAT 1320 

AAAAAGTACT TCATCATGGA TGACAGATAT ACCGAG6GGA CAAGTGGG6A CCCGCAAAAT 1380 

GHCGATACT CmTTACGG C6GTGGGAAC CCGGTTGAGG TTAACCCTAA TGATGTTTGG 1440 

GCTCGGCCAG ACTTTGGAGA CAATTAT6AC TTCCCTACGA ACHCAACTG CAAAGACTAC 1500 

CCCTATCATG GT6GTGTGA6 TTACGGATAT GGGAATGGCA CTCCAGGTTA CTACCCTGAC 1560 

CTTAACAGAG AGGAGGTTCG TATCTGGTGG GGATTGCAGT ACGAGTATCT CTTCAATATG 1620 

GGACTAGAGT TTGTATGGCA AGATATGACA ACCCCAGCGA TCCATTCATC ATATGGAGAC 1680 

AT6AAA6G6T T6CCCACCCG TCTGCTCGTC ACCGCCGACT CAGHACCAA TGCCTCTGAG 1740 

AAAAAGCTCG CAATTGAAAG TTGGGCTCTT TACTCCTACA ACCTCCATAA A6CAACCTTC 1800 

CACGGTCTT6 6TC6TCTT6A GTCTCGTAA6 AACAAACGTA ACTTCATCCT CGGACGTG6T 1860 

AGTTAC6CCG 6T6CCTATCG TTTTGCTGGT CTCTGGACTG 6AGATAACGC AAGTACGTGG 1920 

GAATTCT6GA AGATHCGGT CTCCCAAGTT CTTTCTCTAG GTCTCAATGG TGTGTGTATA 1980 

GCGGGGTCTG ATACGGGTGG TTTTGAGCCC GCACGTACTG AGATTGGGGA GGAGAAATAT 2040 

TGCAGTCC6G AGCTACTCAT CAGGTGGTAT ACTGGATCAT TCCTTTTGCC ATGGCTTAGA 2100 

AACCACTACG TCAAGAAGGA CAGGAAATGG TTCCAGGAAC CATACGCGTA CCCCAAGCAT 2160 

CTTGAAACCC ATCCAGAGCT CGCAGATCAA GCATGGCTTT ACAAATCTGT TCTAGAAATT 2220 

TGCAGATACT GGGTAGAGCT AAGATATTCC CTCATCCAGC TCCTTTACGA CTGCATGTTC 2280 

CAAAACGTGG TCGATG6TAT GCCACHGCC AGATCTATGC TCTTGACCGA TACTGAGGAT 2340 

ACGACCTTCT TCAATGAGAG CCAAAAGHC CTCGATAACC AATATAT6GC T66T6ACGAC 2400 
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ATCCTTGTAG CACCCATCCT CCACAGCCGT AACGAGGTTC CGGGAGAGAA CAGAGATGTC 2460 

TATCTCCCTC TATTCCACAC CTGGTACCCC TCAAACTTGA GACCGTGG6A CGATCAGG6A 2520 

GTCGCTTTAG GGAATCCTGT CGAAGGTGGC AGCGTTATCA ACTACACTGC CAGGATTGTT 2580 

GCCCCAGAGG ATTATAATCT CTTCCACAAC GTGGTGCCGG TCTACATCAG AGAGGGTGCC 2640 

ATCATTCCGC AAATTCAGGT ACGCCA6TGG ATTG6CGAAG 6AGG6CCTAA TCCCATCAAG 2700 

TTCAATATCT ACCCTGGAAA GGACAAGGAG TATGTGACGT ACC7TGATGA T6GTGTTAGC 2760 

CGCGATAGTG CACCAGATGA CCTCCCGCAG TACCGCGAGG CCTATGAGCA A6C6AAGGTC 2820 

GAAGGCAAAG ACGTCCAGAA 6CAACTT6CG GTCATTCAA6 GGAATAAGAC TAATGACTTC 2880 

TCCGCCTCC6 GGATT6ATAA GGAGGCAAA6 GGTTATCACC GCAAAGTHC TATCAAACAG 2940 

GAGTCAAAA6 ACAAGACCCG TACTGTCACC ATTGAGCCAA AACACAACGG ATACGACCCC 3000 

TCTAAGGAAG TTGGTAAHA TTATACCATC ATTCTTTGGT ACGCACCGGG CTTTGACGGC 3060 

AGCATCGTCG ATGTGAGCCA 66C6ACCGTG AACATCGAGG GCGGG6T6GA ATGCGAAATT 3120 

nCAAGAACA CCGGCTTGCA TACGGTTGTA GTCAACGTGA AAGAGGTGAT CGGTACCACA 3180 

AAGTCC6TCA A6ATCACTTG CACTACC6CT TAG 3213 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 201 

(D) OTHER INFORMATION: /note= "X denotes a misc. amino 
acid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Thr Asn Tyr Asn Tyr Asp Asn Leu Asn Tyr Asn Gin Pro Asp Leu 
15 10 15 

lie Pro Pro Gly His Asp Ser Asp Pro Asp Tyr Tyr He Pro Met Tyr 
20 25 30 

Phe Ala Ala Pro Trp Val He Ala His Gly Tyr Arg Gly Thr Ser Asp 
35 40 .. 45 
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Gin Tyr Ser Tyr Gly Trp Phe Leu Asp Asn Val Ser Gin Ser Tyr Thr 
50 55 60. 

Asn Thr Gly Asp Asp Ala Trp Ala Gly Gin Lys Asp Leu Ala Tyr Met 
65 70 75 80 

Gly Ala Gin Cys Gly Pro Phe Asp Gin His Phe Val Tyr Glu Ala Gly 
85 90 95 

Asp Gly Leu Glu Asp Val Val Thr Ala Phe Ser Tyr Leu Gin Gly Lys 
100 105 110 

Glu Tyr Glu Asn Gin Gly Leu Asn lie Arg Ser Ala Met Pro Pro Lys 
115 120 125 

Tyr Val Phe Gly Phe Phe Gin Gly Val Phe Gly Ala Thr Ser Leu Leu 
130 135 140 

Arg Asp Asn Leu Pro Ala Gly 61 u Asn Asn Val Ser Leu Glu Glu lie 
145 150 155 160 

Val Glu Gly Tyr Gin Asn Gin Asn Val Pro Phe Glu Gly Leu Ala Val 
165 170 175 

Asp Val Asp Met Gin Asp Asp Leu Arg Val Phe Thr Thr Arg Pro Ala 
180 185 190 

Phe Trp Thr Ala Asn Lys Val Gly Xaa Gly Gly Asp Pro Asn Asn Lys 
195 200 205 

Ser Val Phe Glu Trp Ala His Asp Arg Gly Leu Val Cys Gin Thr Asn 
210 215 220 

Val Thr Cys Phe Leu Lys Asn Glu Lys Asn Pro Tyr Glu Val Asn Gin 
225 230 235 240 

Ser Leu Arg Glu Lys Gin Leu Tyr Thr Lys Ser Asp Ser Leu Asp Asn 
245 250 255 

He Asp Phe Gly Thr Thr Pro Asp Gly Pro Ser Asp Ala Tyr He Gly 
260 265 270 

His Leu Asp Tyr Gly Gly Gly Val Glu Cys Asp Ala Leu Phe Pro Asp 
275 280 285 

Trp Gly Arg Pro Asp Val Ala Gin Trp Trp Gly Asp Asn Tyr Lys Lys 
290 295 300 

Leu Phe Ser He Gly Leu Asp Phe Val Trp Gin Asp Met 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 amino acids 

(B) TYPE: amino acid 
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(0) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Hodif ied-site 

(B) LOCATION: 272 

(D) OTHER INFORMATION: /note= "X is a misc. amino acid" 

(ix) FEATURE: 

(A) NAME/KEY: Hodif ied-site 

(B) LOCATION: 273 

(D) OTHER INFORMATION: /note- "X is a misc. amino acids" 

(ix) FEATURE: 

(A) NAME/KEY: Hodif ied-site 

(B) LOCATION: 274 

(D) OTHER INFORMATION: /note- "X is a misc. amino acid" 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Thr Asn Tyr Asn Tyr Asp Asn Tyr Asn Tyr Asn Gin Ser Asp Leu 
15 10 15 

He Ala Pro Gly Tyr Pro Ser Asp Pro Asn Phe Tyr lie Pro Met Tyr 
20 25 30 

Phe Ala Ala Pro Trp Val Val Val Lys Gly Cys Ser Gly Asn Ser Asp 
35 40 45 

Glu Gin Tyr Ser Tyr Gly Trp Phe Met Asp Asn Val Ser Gin Thr Tyr 
50 55 60 

Met Asn Thr Gly Gly Thr Ser Trp Asn Cys Gly Glu Glu Asn Leu Ala 
65 70 75 80 

Tyr Met Gly Ala Gin Cys Gly Pro Phe Asp Gin His Phe Val Tyr Gly 
85 90 95 

Asp Gly Asp Gly Leu Glu Asp Val Val Gin Ala Phe Ser Leu Leu Gin 
100 105 no 

Gly Lys Glu Phe Glu Asn Gin Val Leu Asn Lys Arg Ala Val Met Pro 
115 120 125 

Pro Lys Tyr Val Phe Gly Tyr Phe Gin Gly Val Phe Gly He Ala Ser 
130 135 140 

Leu Leu Arg Glu Gin Arg Pro Glu Gly Gly Asn Asn He Ser Val Ser 
145 150 155 160 

Glu He Val Glu Gly Tyr Gin Ser Asn Asn Phe Pro Leu Glu Gly Leu 
165 170 175 
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Ala Val Asp Val Asp Met Gin Gin Asp Leu Arg Cys Ser Ser Pro leu 
180 185 190 

Lys He Glu Phe Trp 7hr Ala Asn Lys Val Gly Thr Gly Gly Asp Scr 
195 200 205 

Asn Asn Lys Ser Val Phe Glu Trp Ala His Asp Lys Gly Leu Val Cys 
210 215 220 

Gin Thr Asn Val Thr Cys Phe Leu Arg Asn Asp Asn Gly Gly Ala Asp 
225 230 235 240 

Tyr Glu Val Asn Gin Thr Leu Arg Glu Lys Gly Leu Tyr Thr Lys Asn 
245 250 255 

Asp Ser Leu Thr Asn Thr Asn Phe Gly Thr Thr Asn Asp Gly Pro Xaa 
260 265 270 

Xaa Xaa Tyr lie Gly His Leu Asp Tyr Gly Gly Gly Gly Asn Cys Asp 
275 280 285 

Ala Leu Phe Pro Asp Trp Gly Arg Pro Gly Val Ala Glu Trp Trp Gly 
290 295 300 

Asp Asn Tyr Ser Lys Leu Phe Lys lie Gly Leu Asp Phe Val Trp Gin 
305 310 315 320 

Asp Met Thr 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 202 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 43 

(D) OTHER INFORMATION: /note« "X is a misc. amino acid' 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 176 

(D) OTHER INFORMATION: /note« "X is a misc. amino acid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Thr Asn Tyr Asn Tyr Asp Asn Leu Asn Tyr Asn Gin Pro Asp Val 
1 5 10-15 
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Val Pro Pro Gly Tyr His Asp His Pro Asn Tyr Tyr He Pro Met Tyr 
20 25 30 

Tyr Ala Ala Pro Trp Leu Val Val Gin Gly Xaa Ala Gly Thr Ser Lys 
35 40 45 

Gin Tyr Ser Tyr Gly Trp Phe Met Asp Asn Val Ser Gin Ser Tyr Met 
50 55 60 

Asn Thr Gly Asp Thr Ala Trp Asn Cys Gly Gin Glu Asn Leu Ala Tyr 
65 70 75 80 

Met Gly Ala Gin Tyr Gly Pro Phe Asp Gin His Phe Val Tyr Gly Asp 
65 90 95 

Gly Asp Gly Leu Glu Asp Val Val Lys Ala Phe Ser Phe Leu Gin Gly 
100 105 no 

Lys Glu Phe Glu Asp Lys Lys Leu Asn Lys Arg Ser Val Met Pro Pro 
115 120 125 

Lys Tyr Val Phe Gly Phe Phe Gin Gly Val Phe Gly Ala Leu Ser Leu 
130 135 140 

Leu Lys Gin Asn Leu Pro Ala Gly Glu Asn Asn He Ser Val Gin Glu 
145 150 155 160 

lie Val Glu Gly Tyr Gin Asp Asn Asp Tyr Pro Phe Glu Gly Leu Xaa 
165 170 175 

Val Asp Val Asp Met Gin Asp Asp Leu Arg Val Phe Thr Thr Lys Pro 
180 185 190 

Glu Tyr Trp Ser Ala Asn Met Val Gly Glu 
195 200 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: miscdifference 

(B) LOCATION: replace (573, 

(D) OTHER INFORMATION: /note- "g is a mi so nucleic acid" 

(ix) FEATURE: 

(A) NAME/KEY: miscdifference 

(B) LOCATION: replace (601, "") 

(0) OTHER INFORMATION: /note= "g is a misc. nucleic acid" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



ATGACAAACT 


ATAATTA7GA 


CAATTTGAAC 


TACAATCAAC CGGACCTCAT 


CCCACCTGGC 


60 


CATGATTCAG 


ATCCTGACTA 


CTATATTCCG 


ATGTACTTTG CGGCACCATG 


GGTGATCGCA 


120 


CATGGATATC 


GTGGCACCAG 


CGACCAGTAC 


TCTTATGGAT GGTTTTTGGA 


CAATGTATCC 


180 


CAGTCCTACA 


CAAACACTGG 


CGATGATGCA 


TGGGCTGGTC AGAAGGATTT 


GGCGTACATG 


240 


GGGGCACAAT 


GTGGGCCTTT 


CGATCAACAT 


7TTGTGTATG AGGCTGGAGA 


TGGACTTGAA 


300 


6ACGTTGTGA 


CCGCAHCTC 


nATTTGCAA 


iSGCAAGGAAT ATGAGAACCA 


GGGACTGAAT 


360 


ATACGHCTG 


CAATGCCTCC 


GAAGTACGTT 


7TCGGATTTT TCCAAGGCGT 


ATTCGGAGCC 


420 


ACATCGCTGC 


TAAGGGACAA 


oTTACCTGCC 


6GCGAGAACA ACGTCTCTTT 


GGAAGAAATT 


480 


GTTGAAGGAT 


ATCAAAATCA 


GAACGTGCCA 


THGAAGGTC TTGCTGTGGA 


TGTTGATATG 


540 


CAA6ATGACT 


T6A6AGTGTT 


CACTACGAGA 


CCGGCGTTTT GGAC6GCAAA 


CAAGGTG6GG 


600 


6AAGGCG6TG 


ATCCAAACAA 


CAAGTCA6TG 


TTTGAGTGGG CACATGACAG 


GGGCCHGTC 


660 


TGCCAGACGA 


AT6TAACTTG 


CTTCTTGAAG 


AACGAGAAAA ATCCTTACGA 


AGTGAATCAG 


720 


TCAHGAGGG 


AGAAGCAGTT 


GTATACGAAG 


AGT6ATTCCT TGGACAACAT 


TGATTTTGGA 


780 


ACTACTCCAG 


ATGGGCCTAG 


CGATGCGTAC 


ATTGGACACT TAGACTACGG 


TGGTGGTGTG 


840 


GAGTGTGATG 


CACTATTCCC 


AGACTGGGGT 


C6ACCAGACG TGGCTCAATG 


GTGGGGCGAT 


900 


AACTACAAGA 


AACTAHCAG 


CATTGGTCTC 


GACnCGTAT GGCAAGACAT 


GAG 


953 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace{814. .821 , "") 

(0) OTHER INFORMATION: /note= "Each g between (and 

including) 814 and 821 is a misc. nucleic acid." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATGACAAACT ACAACTACGA CAACTATAAC TACAACCAGT CAGATCTTAT TGCTCCAGGA 60 
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TATCCTTCCG 


ACCCGAACTT 


CTACATTCCC 


ATGTATTTTG CAGCACCTTG GGTAGTTGTT 


120 


AAG6GATGCA 


GTGGCAACAG 


CGAIGAACAG 


TACTCGTACG GATGGTTTAT GGATAATGTC 


180 


TCCCAAACTT 


ACATGAATAC 


TGGTGGTACT 


TCCTGGAACT GTGGAGAGGA GAACTTGGCA 


240 


TACATGGGAG 


CACAGTGCGG 


TCCATTTGAC 


CAACATTTTG TGTATGGT6A TGGAGATGGT 


300 


CTTGAGGATG 


TTGTCCAAGC 


GTTCTCTCTT 


CTGCAAGGCA AAGAGTTTGA GAACCAAGTT 


360 


CTGAACAAAC 


GTGCCGTAAT 


GCCTCCGAAA 


TATGTGTTTG GTTACTTTCA GGGAGTCTTT 


420 


GGGATTGCTT 


CCTTGTTGAG 


AGAGCAAAGA 


CCAGAGGGTG GTAATAACAT CTCTGTTTCA 


480 


GAGATTGTCG 


AAGGTTACCA 


A A/«/*A ATA A^ 

AAGCAATAAC 


TTCCCTTTAG AGGGGTTAGC CGTAGATGTG 


540 


GATATGCAAC 


AAGATTTGCG 


GTGTAGTTCA 


CCACTGAAGA TTGAATTTTG GACGGCAAAT 


600 


AAGGTAGGCA 


CCGGGGGAGA 


CTCGAATAAC 


AAGTCGGTGT TTGAATGGGC ACATGACAAA 


660 


6GCCTTGTAT 


GTCAGAC6AA 


TGTTACTTGC 


TTCTTGA6AA ACGACAACGG CG6GGCA6AT 


720 


TACGAAGTCA 


ATCAGACAH 


GAGGGAGAAG 


GGTTTGTACA C6AAGAATGA CTCACTGACG 


780 


AACACTAACT 


TCGGAACTAC 


CAAC6AC6GG 


CCGGGGG66G GGTACATT6G ACATCT6GAC 


840 


TATGGTGGCG 


GAGGGAATTG 


TGATGCACTT 


TTCCCAGATT GGGGTCGACC GGGTGTGGCT 


900 


GAATGGTGGG 


GTGATAACTA 


CAGCAAGCTC 


TTCAAAATTG GTCTGGACTT CGTGTGGCAA 


960 


GATATGACA 








969 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAHE/KEY: misc difference 

(B) LOCATION: replace (128, "") 

(D) OTHER INFORMATION: /note= "g is a misc. nucleic acid" 

(ix) FEATURE: 

(A) NAHE/KEY; miscdiffererce 

(B) LOCATION: replace (232, "") 

(D) OTHER INFORMATION: /note= "g is a misc. nucleic acid" 

(ix) FEATURE: 

(A) NAHE/KEY: misc difference 

(B) LOCATION: replice(249, "") 
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(D) OTHER INFORMATION: /note= "g is a misc. nucleic acid" 

(ix) FEATURE: 

(A) NAME/KEY: miscjifference 

(B) LOCATION: replace(526, "") 

(D) OTHER INFORMATION: /note= "g is a misc. nucleic acid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



ATGACAAACT 


ACAATTACGA 


CAACTTGAAC TACAACCAAC CAGACGTCGT TCCTCCAGGT 


60 


TATCACGACC 


ATCCCAACTA 


CTACAnCCA AT6TACTACG CAGCACC6TG GTTG6TCGTT 


120 


CAGGGATGCG 


CGGGGACATC 


GAAGCAATAC TC6TACGGTT GGTTTATG6A CAATGTCTCT 


180 


CAGTCGTACA 


T6AACACTGG 


AGATAC6GCG TGGAACTGCG GACAGGAAAA CGTGGCATAC 


240 


ATGG6CGC6C 


AATAC66GCC 


ATTT6ATCAG CACrTTGTGT ATG6TGATGG AGAT6GCCTT 


300 


GAAGATGTCG 


TCAAAGCGTT 


CTCCTTTCTT CAAGGAAAGG AGTTCGAAGA CAAAAAACTC 


360 


AACAAGCGTT 


CTGTAATGCC 


TCCGAAGTAC GTGnTGGTT TCTTCCAGGG TGTTTTCGGT 


420 


GCACTTTCAC 


T6TT6AAGCA 


GAATCTGCCT GCCG6AGAGA ACAACATCTC A6TGCAA6AG 


480 


ATTGTGGAGG 


GTTACCAGGA 


TAACGACTAC CCCTTTGAAG GGCTCGC6GT A6ATGTTGAT 


540 


ATGCAAGATG 


ATCTGC6AGT 


GTTTACTACC AAACCAGAAT ATTGGTCGGC AAACATGGTA 


600 


GGCGAAG 






607 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Tyr Arg Trp Gin Glu Val Leu Tyr Thr Ala Met Tyr Gin Asn Ala Ala 
15 10 15 

Phe Gly Lys Pro He He Lys Ala Ala Ser Met Tyr Asn Asn Asp Ser 
20 25 30 

Asn Val Arg Arg Ala Gin Asn Asp His Phe Leu Leu Gly Gly His Asp 
35 40 45 

Gly Tyr Arg lie Leu Cys Ala Pro Val Val Trp Glu Asn Ser Thr Glu 

50 55 60 
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Arg Glu Leu T^r Leu Pro Val Leu Thr Gin Trp lyr Lys Phe Gly Pro 
65 70 75 80 

Asp Phe Asp Thr Lys Pro Leu Glu Gly Ala 
85 90 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: inisc_difference 

(B) LOCATION: replace(6, "") 
(D) OTHER INFORMATION 5 /note= 

(ix) FEATURE: 

(A) NAME/KEY: roisc_difference 

(B) LOCATION: replace(9, "") 
(D) OTHER INFORMATION: /note= 

(ix) FEATURE: 

(A) NAME/KEY: raise difference 

(B) LOCATION: replace (12, "") 
(D) OTHER INFORMATION: /note= 

(ix) FEATURE: 

(A) NAME/KEY: raise difference 

(B) LOCATION: replace(I5, 

(D) OTHER INFORMATION: /note= 

(ix) FEATURE: 

(A) NAME/KEY: raise difference 

(B) LOCATION: replace(18, 

(D) OTHER INFORMATION: /note= 

(ix) FEATURE: 

(A) NAME /KEY: raise difference 

(B) LOCATION: replace(21, "") 
(D) OTHER INFORMATION: /note" 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGTANAANA ANGANTCNAA NGT 23 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 



•N is T OR C" 



"N IS C OR T 



"N IS C OR T 



"N IS C OR T" 



"N IS G OR A OR T OR C" 



"N IS C OR T" 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(1x) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(6, "") 

(D) OTHER INFORMATION: /note= "N IS T OR C" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(9. "") 

(D) OTHER INFORMATION: /note= "N IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(12, 

(D) OTHER INFORMATIONS /note= "H IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(I5, "") 

(D) OTHER INFORMATION: /note= "N IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replice(18, "") 

(D) OTHER INFORMATION: /note= "N IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replice(21, "") 

(D) OTHER INFORMATION: /note» "N IS C OR T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATGTANAANA ANGANAGNAA NGT 23 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replice(3, "") 
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(D) OTHER INFORMATION; /note= "N IS G OR A OR T OR C" 

(ix) FEATURE: 

(A) NAME/KEY: raise difference 

(B) LOCATION: rep1ace(6, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: raise difference 

(B) LOCATION: replace(9, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: raise difference 

(B) LOCATION: repUce(12, "") 

(0) OTHER INFORMATION: /note= "N IS G OR A OR T OR C" 

(ix) FEATURE; 

(A) NAME/KEY: niisc dlfference 

(B) LOCATION: replace(15, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A OR T OR C" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TANCCNTCNT GNCCNCC 17 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replice{3, "») 

(D) OTHER INFORMATION: /note= "N IS G OR A OR T OR C" 

(ix) FEATURE: 

(A) NAME/KEY: iiiisc_difference 

(B) LOCATION: replace(6, "") 

(0) OTHER INFORMATION: /note= "N IS 6 OR A" 

(ix) FEATURE: 

(A) NAME/KEY: miscdifference 

(B) LOCATION: replace(9, "") 

(D) OTHER INFORMATION: /note= "N IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: misc_difference 

(B) LOCATION: replace(12, "") 
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(D) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: miscdifference 

(B) LOCATION: replace(18, "") 

(D) OTHER INFORMATION: /note= "N IS C OR T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 19: 
GGNCCNAANT TNTACCANTG 20 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(3, "") 

(D) OTHER INFORMATION: /note= "N IS T OR C" 

(ix) FEATURE: 

(A) NAME/KEY: raise difference 

(B) LOCATION: replace (6, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A OR T OR C" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replice(12, •") 

(D) OTHER INFORMATION: /note- "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY; misc difference 

(B) LOCATION: replice{15, "") 

(D) OTHER INFORMATION: /note" "N IS G OR A" 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 20: 
TANCGNTGGC ANGANGT 17 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
(0) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: miscdifference 

(B) LOCATION: replace(3, "") 

(D) OTHER INFORMATION: /note= "N IS T OR C" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(6, "") 

(0) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(12, "") 

(D) OTHER INFORMATION: /note- "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replice(15, "") 

(D) OTHER INFORMATION: /note- "N IS G OR A" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TANAGNTG6C ANGAN6T 17 
(2) INFORMATION FOR SEQ ID NO: 22: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
ATGTACAACA ACGACTCGAA CGTTC6CAG6 GCGCAGAACG ATCATHCCT TCTTGGCGGC 60 
CACGACGGTT A 71 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Tyr Asn Asn Asp Ser Asn Val Arg Arg Ala Gin Asn Asp His Phe 
15 10 15 

Leu Leu Gly Gly His Asp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ATGTACAACA AC6ACTCGAA CGTTCGCAGG GCGCAGAACG ATCATTTCCT TCTTGGT6GA 
CAT6ATGGAT ATCGCATTCT GTGCGCGCCT GTTGTGTG6G A6AATTCGAC CGAACGGAAT 
TGTACTTGCC CGTGCTGACC CAATGGTACA AAHCGGCCC 
(2) INFORMATION FOR SEQ 10 NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(i1) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Tyr Asn Asn Asp Ser Asn Val Arg Arg Ala Gin Asn Asp His Phe 
15 10 15 

Leu Leu Gly Gly His Asp Gly Tyr Arg He Leu Cys Ala Pro Val Val 
20 25 30 

Trp Glu Asn Ser Thr Glu Arg Glu Leu Tyr Leu Pro Val Leu Thr Gin 
35 40 45 

Trp Tyr Lys Phe Gly Pro 
50 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 



60 
120 
160 
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(A) LENGTH: 238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSj double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TACAGGTGGC AGGAGGTGH GTACACTGCT ATGTACCAGA ATGCGGCTTT CGGGAAACCG 60 

ATTATCAA6G CAGCTTCCAT 6TACGACAAC 6ACA6AAACG TTCGCGGCGC ACAGGATGAC 120 

CACTTCCnC TCGGCGGACA CGATGGATAT CGTATTTTGT GTGCACCTGT TGTGTGGGAG 180 

AATACAACCA GTCGCGATCT GTACTTGCCT GTGCTGACCA GTGGTACAAA TTCGGCCC 238 
(2) INFORMATION FOR SEQ ID NO: 27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Tyr Arg Trp Gin Glu Val Leu Tyr Thr Ala Met Tyr Gin Asn Ala Ala 
15 10 15 

Phe Gly Lys Pro lie He Lys Ala Ala Ser Met Tyr Asp Asn Asp Arg 
20 25 30 

Asn Val Arg Gly Ala G1n Asp Asp His Phe Leu Leu Gly Gly His Asp 
35 40 45 

Gly Tyr Arg lie Leu Cys Ala Pro Val Val Trp Glu Asn Thr Thr Ser 
50 55 60 

Arg Asp Leu Tyr Leu Pro Val Leu Thr Lys Trp Tyr Lys Phe Gly 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(Ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 28: 
GCTCTAGAGC ATGTTTTCAA CCCTTGCG 28 
(2) INFORMATION FOR SEQ ID NO: 29: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 29: 
AGCTT6TTAA CATGTATCCA ACCCTCACCT TCGTGG 36 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DHA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
ACAAHGTAC ATA6GTTGGG A6TGGAAGCA CCGC 34 
(2) INFORMATION FOR SEQ 10 NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

lys Asn Leu His Pro Gin His Lys Met Leu Lys Asp Thr Val Leu Asp 
15 10 15 

He Val Lys Pro Gly His 61y Glu Tyr Val 61y Trp Gly Glu Met 61y 
20 25 30 
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Gly lie Gin Phe Met Lys Glu Pro Thr Phe Met Asn Tyr Phe Asn Phe 
35 40 .45 

Asp Asn Met 61n Tyr Gin Gin Val Tyr Ala Gin Gly Ala Leu Asp Ser 
50 55 60 

Arg Glu Pro Leu Tyr His Ser Asp Pro Phe Tyr 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: raise difference 

(B) LOCATION: replace{3, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: niisc dif ference 

(B) LOCATION: replace(6, "") 

(D) OTHER INFORMATION: /note= "N IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replice(9, "") 

(D) OTHER INFORMATION: /note- "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replice(15, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A OT T OR C" 

(ix) FEATURE: 

(A) NAME/KEY: mi scdif ference 

(B) LOCATION: rBplace(18, "") 

(D) OTHER INFORMATION: /note= "N IS 6 OR A" 

(ix) FEATURE: 

(A) NAME/KEY: mi scdif ference 

(B) LOCATION: replace(21, "") 

(D) OTHER INFORMATION: /note- "N IS C OR T" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CANCANAANA TGCTNAANGA NAG 23 
(2) INFORMATION FOR SEQ ID NO: 33: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDMESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: miscjifference 

(B) LOCATION: replace(3, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: mi so difference 

(B) LOCATION: replace{6, 

(D) OTHER INFORMATION: /note= "N IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(9, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replic6(15, «") 

(0) OTHER INFORMATION: /note= "N IS 6 OR A" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replice(18, "") 

(D) OTHER INFORMATION: /note« "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace{21, "") 

(0) OTHER INFORMATION: /note= "N IS C OR T" 



(Xt) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CANCANAANA T6TTNAANGA NAC 23 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 
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(A) NAME/KEY: misc difference 

(B) LOCATION: replice(3, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: rep1ace(6, 

(D) OTHER INFORMATION: /note= "N IS G OR A OR T OR C" 

(ix) FEATURE: 

(A) NAME/KEY: niisc_difference 

(B) LOCATION: replace(9, "") 

(D) OTHER INFORMATION: /note« "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: wise difference 

(B) LOCATION: replice(12, "") 

(D) OTHER INFORMATION: /note= "N IS 6 OR A" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(15, "") 

(D) OTHER INFORMATION: /note- "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: inisc_difference 

(B) LOCATION: replace (18, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
. TANAANGGNT CNCTNTGNTA 20 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(3, "") 

(0) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE: 

(A) NAME /KEY: misc_difference 

(B) LOCATION: replace (6, "") 

(D) OTHER INFORMATION: /note* "N IS G OR A OR T OR C" 
(ix) FEATURE: 
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(A) NAME/KEY: raiscdifference 

(B) LOCATION: replace(9. "") 

(D) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE; 

(A) NAME/KEY: misc difference 

(B) LOCATION: repTace(12, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A OR T OR C 

(ix) FEATURE: 

(A) NAME/KEY: raise difference 

(B) LOCATION: replace(I5, 

(D) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(I8. "") 

(D) OTHER INFORMATION: /note= "N IS G OR A" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

TANAAN66NT CN6ANTGNTA 20 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEONESS: single 
(0) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
AAACTGCAGC TGGC6CGCCA TGGCAGGATT TTCTGAT 37 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: repUce(6, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A OR T OR C 
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(ix) FEATURE: 

(A) NAME/KEY: inisc_difference 

(B) LOCATION: replace(9, "") 

(0) OTHER INFORMATION: /note= "N IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: raise difference 

(B) LOCATION: replice(12, "") 

(D) OTHER INFORMATION: /note= "N IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: miscdifference 

(B) LOCATION: replace(15, 

(D) OTHER INFORMATION: /note- "N IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(18, "") 

(D) OTHER INFORMATION: /note- "N IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: raise difference 

(B) LOCATION: replice(21, *") 

(D) OTHER INFORMATION: /note- "N IS C OR T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
ATGACNAANT ANAANTANGA NAA 23 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replice(l, "") 

(D) OTHER INFORMATION: /note= "N IS A OR G" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace (4, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A OR T OR C" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace (13, "") 

(0) OTHER INFORMATION: /note= "N IS G OR A OR T OR C" 
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(ix) FEATURE: 

(A) NAME/KEV: misc difference 

(B) LOCATION; replice(16, 

(D) OTHER INFORMATION: /note= "N IS G OR A OR T OR C" 

(IX) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(19. "") 

(D) OTHER INFORMATION: /note= "N IS G OR A OR T OR C" 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
NTGNGGCATC ATN6CNG6NA C 21 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDMA 



(ix) FEATURE; 

(A) NAME/KEY: miscdifference 

(B) LOCATION: replace(5, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(9, "") 

(D) OTHER INFORMATION: /note- "N IS C OR T" 

(ix) FEATURE: 

(A) NAME/KEY: niisc_difference 

(B) LOCATION: replace(15, "") 

(D) OTHER INFORMATION: /note= "N IS G OR A OR T OR C 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: replace(18, "") 

(D) OTHER INFORMATION: /note- "N IS G OR A" 

(ix) FEATURE: 

(A) NAME/KEY: misc_difference 

(B) LOCATION: replace (21, "") 

(D) OTHER INFORMATION: /note- "N IS G OR A" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GTCATNTCNT GCCANACNAA NTC 23 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulc I36iy) 



A. Tbe indications made below relate to the cnicroorganisni referred to in tbe description 
\(4> ,line l(p 



B. IDENTIFICATION OF DEPOSIT 



Further deposiu are identified on an additional a beet 



Name of depositary instinition 

The National Collections of Industrial and Marine Bacteria Limited (NCIMB) 



Address of depositary instirution fmcluding postal cade §nd taaatry) 

23 St. Machar Drive 
Aberdeen 
Scotland 
AB2 IRY 

United Kingdom 



Date of deposit 



Accession Number 



C ADDITIONAL INDICATIONS (leave blank if tut applkahle) This information is continued on an additional sheet 



In respect of those designations in which a European patent is sought » and any 
other designated state having equivalent legislation, a sample of the deposited 
microorganism will be made available until the publication of the mention of the 
grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn, only by the issue of such a 
sample to an expert nominated by the person requesting the sample, (Rule 28(4} 
EEC). 



D. DESIGNATED STATES FOR WHICH mmCATlONS AUEMAm (if the indii^lioas art not for all iesignaied^^^ 



£. SEPARATE FURNISHING OF INDICATIONS (leave blank if not eppltcabk) 



Tbe indications listed below will be submitted to the International Bureau later (specify the gen^etriature of the indications eg., ^Accession 
NamberefD^ant') 



TikM sheet was received with the internaiiona) application 



For receiving Office use only 



Auiboriud officer 




TfT. 



FonaPaMO'134(JoW I99:>) 



For International Bureau use only 



This sheet was received by the International Bureau on: 



Authorticd orOcer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule X3bis) 



A. The indicalioos made below reUte to the mtaoorgacusm ceferred to in the description 
IM' .line \t 



on page 



IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet [ j 



Name of depositary institution 
The National Collections of Industrial and fiarine Bacteria Limited (NCIMB) 



Date of deposit ^ . - 


Accession Number / r**^ 


C. ADDITIONAL INDICATIONS ({webludtifMct &pp!kaUc) This information is continued on an additional sheet 



Addcess of depositary iostitutioa (inciting pottal code Mitdcoiuary) 

23 St. Machar Drive 
Aberdeen 
Scotland 
AB2 IRY 



In respect of those designations in which a European patent is sought, and any 
other designated state having equivalent legislation, a sample of the deposited 
microorganism will be made available until the publication of the mention of the 
grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn » only by the issue of such a 
sample to an expert nominated by the person requesting the sample* (Rule 28(4) 
EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (ifiheindtcaupmsgftiuHjor^HdeapuUtdSictes) 



E. SEPARATE FURNISHING OF irWICATIONS (iea^bUtik if not appiictbic) 



The indications listed below wil I be submitted to the Intemaiional Bureau later (spectfyihegeno'ai nature of the wdicattons eg.. 'Accession 
Number tf Deposit') 



^ Ibis sbeetwas leoeived with the international application 


Authorized oCQoer /^^^^f^ 





Form PCr/»< Vi3^ (July \P^2) 



For International Bureau use only 



rn Ibis sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCr Rule I30is) 



A. The indications raade bcJow relate to the cnicroorganisra referred to tn the description 
JM- .line 



on page 



B. IDENTIFICATION OF DEPOSIT 



Furtber deposits are identiFied on a n a dditional abeet 



Name of deposiury institution 
Culture Collection of Algae and Protozoa (CCAP) 



Dateofdepoaii 


Accession Number . 

ocftf \ri%l\ 


C. ADDITIONAL INDICATIONS (leave hUnk if not tp/JU^I^ Ibis infonnation is oontiotted on an additional abeet Q 



Address of deposiury institution (wdrndimg postal code MUcmMry) 

Dunstaffnage Marine Laboratory 

P.O, Box 3 

Oban 

Argyll PA34 4AD 
Scotland United Kingdom 



In respect of those designations in which a European patent is sought, and any 
other designated state having equivalent legislation, a sa&ple of the deposited 
microorganism will be made available until the publication of the mention of the 
grant of the European patent or until the date on which the application has b en 
refused or vlthdravn or is deemed to be withdrawn , only by the issue of such a 
sample to an expert nominated by the person requesting the sample. (Rule 28(4} 
EPC). 

D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the ideations ere not for eUdehpiaied States^ 



£. SEPARATE FURNISHING OF INDICATIONS Han^hhnk if not ^pplicabU) 



Tbe indications listed below will be submitted to the International Bureiu htcr (specify the general natutc of the imdtcatiofue.f., 'Accession 
NambercfBeposit') 



For receiving Office use only 



P?] Hits sheet was received witb tbe international application 
Amborircdoflicer /^^Zt*4m^^ 



ER 



For international Bureau use only 



Q] Ibis sheet was received by the International Bureau i 



Authorized ofTicer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(per Rule \2bis) 



A. The indications made beiow relate to the microor^nism referred to in the description 
onpa^e 1^ .line 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an addttioniUbeet Q 



Name of deposiuiy institutton 
The National Collections of Industrial and Marine Bacteria Limited (NCIMB) 



Address of deposiury institution fwcluding postal code andtouMry) 

23 St, Machar Drive 
Aberdeen 
Scotland 
AB2 IRY 

United Kingdom 



Dace of deposit _ _ ^ 

3 ccToSe<^ 



ADoesiion Number 



C* ADDITIONAL INDICATIONS (leave l^bfik if net appUcahle) This informition is oontioued on an additional abeet 



In respect of those designations in which a European patent is sought, and any 
other designated state having equivalent legislation, a sample of the deposited 
microorganism will be made available tintil the publication of the mention of the 
grant of the European patent or until the date on which the application has been 
refused or vithdrawn or is deemed to be withdravm, only by the issue of such a 
sample to an expert nominated by the person requesting the sample. (Rule 28(4) 
ETC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADt(ifihei>idieauo0uoreiioi foralliegipiaiedSiaUa} 



£. SEPARATE FURNISHING OF INDICATIONS (/one bknk ifnoi eppticabie) 

Ibe indications listed below will be submtcied lo the Intemationai Bureau laier (specify the gcntrai nature of the indications e.g., 'Acceaton 
Nmmb€rcf Deposit*} 



For receiving Office use only 



This sheet .was feoeived with the international application 



AuUiorized oOficer 




lAKER 



Form PCT/RO/i3d ^Juij 



For tntemaiional Bureau use on)y 



n This sbeei was received by the International Bureau on: 



AutbonLed offtcer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 



A. Tbc indications made beiow relate to the intcroorpnum referred to in the description 
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Qh MMS 

1. A method of preparing the sugar 1,5-D-anhydrofructosc comprising treating an a- 
1,4-glucan with the enzyme a- 1 ,4-glucan lyase characterised in that enzyme is used 

5 in substantially pure form. 

2. A method according to claim I wherein if the glucan contains links other than and 
in addition to the a-1,4- links the a-l,4-glucan lyase is used in conjunction with a 
suitable reagent that can break the other links. 

10 

3. A method according to claim 2 wherein the glucan is starch and a hydrolase, 
preferably a glucanohydrolase, is used in conjunction with the a-l,4-glucan lyase. 

4. A method according to claim 2 or claim 3 wherein the hydrolase is at least one of 
IS puUanase or isoamylase. 

5. A method according to any preceding claim wherein the a-l,4-glucan lyase is 
bound to a support or, more preferably, is in a dissolved form. 

20 6. A method according to any preceding claim wherein the enzyme is isolated from 
either a fungus, preferably Morchella costata or Morchella vulgaris, or from a 
fungally infected algae, preferably Gracilariopsis lenumeiformis or from algae alone, 
preferably GracUariopsis lemaneiformis. 

25 7. A method according to claim 6 wherein the enzyme is isolated and/or further 
purified from the fungus or from the fungally infected algae or from algae alone using 
a gel that is not degraded by the enzyme. 

8. A method according to claim 7 wherein the gel is based on dextrin or derivatives 
30 thereof, preferably the gel is a cyclodextiin - more preferably beta-cyclodextrin. 
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9. A method according to any of the preceding claims wherein the enzyme comprises 
the amino acid sequence SEQ. ID. No. I. or the amino acid sequence SEQ. ID.- No. 
2 or the amino acid sequence SEQ. ID. No. 5. or the amino acid sequence SEQ. ID. 
No. 6, or any variant thereof. 

5 

10. A method according to any preceding claim wherein the enzyme is obtained from 
the expression of a nucleotide sequence coding for the enzyme. 

11. A method according to claim 10 wherein the nucleotide sequence is a DNA 
10 sequence. 

12. A method according to claim 1 1 wherein the DNA sequence comprises a sequence 
that is the same as, or is complementary to, or has substantial homology with, or 
contains any suitable codon substitutions for any of those of, SEQ. ID. No. 3 or 

15 SEQ. ID. No. 4 or SEQ. ID. No. 7 or SEQ. ID. No. 8. 

13. Tbe method according to claim 3 or any claim dqiendent thereon wherein the 
starch is used in high concmtration - such as up to about 25% soludon. 

20 14.The method according to any one of the preoedmg claims wh^^ the substrate 
is treated with the enzyme in the presence of a buffer. 

15. The method according to any one of claims 1 to 13 wh^dn the substrate is treated 
with the enzyme in the presence of at least substantially pure water. 

25 

16. The method according to any one of the preceding claims wherein the substrate 
is treated with the enzyme m the absence of a co-factor. 

IT.The method according to any one of the preceding claims wherein the oizyme is 
30 used in combination with amylopecdn or dextrin. 
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18. A method of preparing the sugar 1,5-D'anhydrofructose comprising treating an 
(y-l,4-glucan with the enzyme o-l,4-glucan lyase characterised in that enzyme 
comprises the amino acid sequence SEQ. ID. No. 1. or the amino acid sequence 
SEQ. ID. No. 2 or the amino acid sequence SEQ. ID. No. 5. or the amino acid 

5 sequence SEQ. ID. No. 6, or any variant thereof. 

19, The sugar 1,5-D-anhydrofhictose when prepared by the method of the present 
invention. 

10 20. The use of a reagent that can increase the hydrophobicity of the reaction medium 
to increase the stability and activity of the GL enzyme. 

21. Use of AF as an anti-oxidant. 

15 22. Use of AF as a sweetener. 
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Fig. 1 . Calcoflour White staintngs revealing fungi in upper part and lower part of Gract- 
laria iemnaeformis. (108x and 294x). 
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i ig. .V The microiirapl\ slious lon^iuidina! and gracing wtuons of iwo ihin-wallcd fungal 
hypha (0 grou inii bciwccn thick walls (w) of algal cells. Noic ihylacoid membranes m 
ilic algal chloroplnsi (arrows) 
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Fig.5. Intense antisense detections with clone 2 probe are found over the fungi in 
Gracilaria lemnaeformis (294x). 
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FIGURE 8 



HFSTLAFVAP SALGASTFVG AEVRSNVRIH SAFPAVHTAT RKTNRLNVSM TALSDKQTAT AGSTDNPDGI 

OYKTYOYVGV WGFSPLSNTN WFAAGSSTPG GITDWTATMN VNFORIDNPS ITVQHPVQVQ VTSYNNNSYR 

VRFNPDGPIR DVTRGPILKQ QLDWIRTQEL SEGCDPGMTF TSEGFLTFET KDLSVIIYGN FKTRVTRKSD 

GKVIMENDEV GTASSGNKCR GLMFVDRLYG NAIASVNK NF RNDAVKOEGF YGAGEVNCK Y QDTYILERTG 

lAMTNYNYDN LNYNQWDLRP PHHDGALNPD YYIPMYYAAP WLIVNGCAGT SEQYSYGWFM DNVSQSYMNT 

GDTTWNSGQE DLAYMGAQYG PFDQHFVYGA GGGMECVVTA FSLLQGKEFF NQVLNKRSVM PPKYVFGFFQ 

GVFGTSSLLR AHMPAGENNI SVEEIVEGYQ NNNFPFEGLA VOVDHQDNLR VFTTKGEFWT ANRVGTGGDP 

NWRSVFEW AH D K6LVC0TNI TCFLRNDNE6 ODYEVNOTLR ER OLYTKNDS LT6TDFGMTD DGPSOAYIGH 

LDYGGGVECD ALFPDWGRPD VAEWWGNNYK KLFSIGLDFV WQDHTVPAHH PH KIGDDINV KPDGNWPNAD 

BESNGQYNWK TYHP OVLVTD HRYENHGREP MVTQRN IHAY TLCESTRKEG IVENADTLTK FRRSYIISRG 

GYIGNQHFGG MWVGDNSHS NYIQMMIANN INMNHSCLPL VGSDIGGFTS YDNENQRTPC TGDLMVRYVQ 

AGCLLPWFR N HYDRWIESKD HGKDYQELYM YPNEMDTLRK FVEF RYRWQE VLYTAMYONA AFGKPIIKAA 

SMYNNDSNVR RAONDHFLLG GHDGYRILCA PVVWENSTER ELYLPVLTQW YKF6PDFDTK PLEGAM NGGD 

RIYNYPVPQS ESPIFVRE6A ILPTRYTLNG ENKSLN7YTD EOPLVFEVFP LGNNRADGMC YLDDGGVTTN 

AEDNGKFSVV KVAAEODGGT ETITFTNDCY EYVFG6 PFYV RVRGAQSPSN IHVSS6AGSQ OMKVSSATSR 

AALFND6ENG DFWVDOETDS LWLKLPNVVL PDAVIT IT 
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Figure \Q. Microphotograph of a fungal hypha (f ) growing bet- 
ween algal cell walls (w) . Note grains of floridean starch (s) 
and thylakoids (arrows) in the algal cell. Bar = 2 um 
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1 AGACAGGTGC GTTTTTGTTT ATTCTATTCT GTGCGGCAGA TATGCACTCA CAAGAAACAA 
61 ATTGTACAAA TATTTCTAAT TACAGTTGTA GGTGCAGTTG AAAATCCGGT CGCACAAAGA 
121 TCATTGATGC ACAAAGATGA TAACGCCTGA TTAGTACTCA AGGTHAATT GGGTATGTGT 
181 GCGACCTCTC TTTGGCTAGC ATTACCTGAT TGGTTACAAC TGCAAATACT GCGGCAGCAA 
241 TGAGGAATGA AGTCAGCATC GATAGCTCGG CCTCATAAAA ATTGATHCA ATHTATATT 
301 CCCAGTTTTA ATCTCGAATC CTATATAATG GCCATCGTTC CCTCCTCGCC TCTTCATTCT 
361 CCTCCATCAC TCCAGCTCAG TCATCCCTCA ACTTGGCCTC CTCTGATATC TTCCGAACAA 
421 AACATCTTGT CCAATCTTTT mGAGCTAG ATCTCATTAT ACCTCC6TCA TGGCAG6ATT 
481 TTCT6ATCCT CTCAACTTTT GCAAAGCAGA A6ACTACTAC AGIGHGCGC TA6ACTGGAA 
541 G6GCCCTCAA AAAATCAHG 6AGTAGACAC TACTCCTCCA AAGAGCACCA AGHCCCCAA 
601 AAACTGGCAT G6AGTGAACT TGA6ATTC6A TGATGGGACT TTA66TGTGG TTCA6TTCAT 
661 TAGGCCGTGC GHTGGAGGG HAGATACGA CCCTGGnTC AAGACCTCTG ACGAGTATGG 
721 TGATGAGAAT A C6T6AGTTA CCCCATATGT CATTATTGGT AGC6AAAAAC ATAT6CTAAT 
781 CAACTAACGA GGCATATAfi fi AGGACAAHG TGCAAGATTA TATGAGTACT CTGAGTAATA 
841 AATTGGATAC TTATAGAGGT CTTACGTGGG AAACCAAGT6 TGAGGATTCG GGAGATTTCT 
901 TTACCTTCTC AGTAAGTGCC AGTACTGCTA TAGCTCCGCT ATATATATAA CACCACTAAC 
961 TAACTGCCCT AAATAG TCCA AGGTCACCGC CGTTGAAAAA TCCGAGCGGA CCCGCAACAA 
1021 6GTCGGCGAT GGCCTCAGAA TTCACCTATG GAAAAGCCCT TTCCGCATCC AAGTAGTGCG 
1081 CACCTTGACC CCTHGAAGG ATCCTTACCC CATTCCAAAT GTAGCCGCAG CCGAAGCCCG 
1141 TGTGTCCGAC AAGGTCGTTT GGCAAACGTC TCCCAAGACA TTCAGAAAGA ACCTGCATCC 
1201 GCAACACAAG ATGCTAAAGG ATACAGTTCT TGACATTGTC AAACCT6GAC AT6GCGAGTA 
1261 TGTGGGGTGG GGAGAGATGG GAGGTATCCA GTTTATGAAG GAGCCAACAT TCATGAACTA 
1321 TTTT A6TAA6 CCCCGAAGAG GTTCCTTATA AATTCTTGGT GGTCATTTTT ACTAACCCA6 
1381 TGTAG ACTTC GACAATATGC AATACCA6CA A6TCTATGCC CAA6GT6CTC TCGATTCTCG 
1441 CGAGCCACT G TAAGTACCGT CCTGTGGCAC 6ACTTAACCC AATAACTAAT CTTTCAACAA 
1501 fiGTACCACTC G6ATCCCTTC TATCTTGATG TGAACTCCAA CCCGGAGCAC AAGAATATCA 
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FIGURE 14 CONTINUED 



1561 CGGCAACCTT TATCGATAAC TACTCTCAAA TTGCCATCGA CTTT66AAAG ACCAACTCA6 
1621 GCTACATCAA GCTGGGAACC AGGTATGGTG GTATCGATTG TTACGGTATC AGTGCGGA'TA 
1681 CGGTCCCGGA AATTGTACGA CTTTATACAG GTCTTGTTGG ACGTTCAAAG TTGAAGCCCA 
1741 GATATATTCT CGGGGCCCAT CAAGCC TGTA A6TCCTTCCC CTCAT6AGTG ATTTATCAGA 
1801 CTT6CATAAT AAACTAACCT CGTTTTCAAA G GTTAT6GAT ACCAACAGGA AAGTGACTT6 
1861 TATTCT6TGG TCCAGCAGTA CC6TGACTGT AAATHCCAC TTGACGGGAT TCACGTC6AT 
1921 GTCGATGTTC AG GTAAATGG CCATGGTATC ATTGAAGCTT T6AGAAATGT TrTAArTGTC 
1981 TTTATAACAT TCCTAG GAC6 GCTTCAGAAC TTTCACCACC AACCCACACA CTHCCCTAA 
2041 CCCCAAAGAG ATGTTTACTA ACHGAGGAA TAATGGAATC AAGTGCTCCA CCAATATCAC 
2101 TCCTGHATC AGCATTAACA ACAGAGAGGG TGGATACAGT ACCCTCCTTG AGGGAGTTGA 
2161 CAAAAAATAC TTTATCATGG AC6ACAGATA TACCGAG6GA ACAAGT666A ATGCGAA6GA 
2221 TGTTCGGTAC ATGTACTACG GTGGTGGTAA TAAGGTTGA6 6TCGATCCTA ATGATGHAA 
2281 TGGTCGGCCA GACHTAAAG ACAACTA GTA AGTTGTTTAT TTGACTACGA TAG6TAACCC 
2341 GTAAGCGGCA TfAACATATT TGTAG TGACT TCCCCGCGAA CTTCAACAGC AAACAATACC 
2401 CCTATCATG6 T6GTGT6A6C TACG6TTATG GGAACG6TA6 TGTAAGTGAC GATATCTCAC 
2461 CAACATAATG AAATTTATAA G6ACTAACTA 6ACACAAAAA TTTGTAG fiCA GGTTTTTACC 
2521 CG6ACCTCAA CAGAAAGGAG GTTCGTATCT GGTGGGGAAT GCAGTACAAG TATCTCTTCG 
2581 ATATGGGACT GGAATTTGTG TGGCAAGACA TGACTACCCC AGCAATCCAC ACATCATATG 
2641 GAGACATGAA AGGGTTGCCC ACCCGTCTAC TCGTCACCTC AGACTCCGTC ACCAATGCCT 
2701 CTGAGAAAAA GCTCGCAATT GAAACTTGGG CTCTCTACTC CTACAATCTC CACAAAGCAA 
2761 CTTGGCATGG TCHAGTCGT CTCGAATCTC GTAA6AACAA ACGAAACTTC ATCCTCGGGC 
2821 GTCGAAGHA TGCCGGAGCC TATCGTmG CTGGTCTCTG GACTGGGGAT AATGCAAGTA 
2881 ACTGGGAAn CTGGAAGATA TCGGTCTCTC AAGHCTTTC TCTGGGCCTC AATG6TGTGT 
2941 GCATCGCGGG GTCTGATACG GGTGGnTTG AACCCTACCG TGATGCAAAT GGGGTCGAGG 
3001 AGAAATACTG TAGCCCAGAG CTACTCATCA GGTGGTATAC IGGHCATTC CTCTTGCCGT 
3061 GGCTCAGGAA CCATTATGTC AAAAA6GACA GGAAATGGTT CCA G6TAATC TATCCTTTCT 
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FIGURE 14 CONTINUED 



3121 TATCTTT6AA GCftTTGAAGA TACTAA6ATA TAATCTAG GA ACCATACTCG TACCCCAAGC 
3181 ATCTTGAAAC CCATCCAGAA CTCGCAGACC AAGCATGGCT CTATAAATCC GTTTTGGAGA 
3241 TCTGTAGGTA CTATGTGGAG CT7AGATACT CCCTCATCCA ACTACTTTAC GACTGCATGT 
3301 TTCAAAACGT AGTCGAC6GT ATGCCAATCA CCAGATCTAT 6 GTATGTATT CTACCCTAGG 
3361 CTTCCAGAGC AACATATGCT AACCAATTGA ACCTGGGTTT CTAG CTCTTG ACCGATACTG 
342] AGGATACCAC CTTCTTCAAC GAGAGCCAAA AGTTCCTCGA CAACCAATAT ATGGCTGGTG 
3481 ACGACATTCT TGTTGCACCC ATCCTCCACA GTCGCAAAGA AATTCCAGGC GAAAACAGAG 
3541 ATGTCTATCT CCCTCTTTAC CACACCTGGT ACCCCTCAAA HTGAGACCA TG6GAC6ATC 
3601 AAGGAGTCGC HIGGGGAAT CCTGTCGAAG GTGGTAGTGT CATCAATTAT ACTGCTAGGA 
3661 TTGHGCACC CGAGGAHAT AATCTCTTCC ACAGCGTGGT ACCAGTCTAC GTTAGAGAGG 
3721 GTAAGCA6TA AAATAATCTC nCCCAGTTT CAAATACATT TAGCTAGTAG CTAACGCTAT 
3781 GAACCTACAG GTGCCATCAT CCCGCAAATC GAAGTACGCC AATGGACTGG CCAGGGGGGA 
3841 GCCAACCGCA TCAAGTTCAA CATCTACCCT GGAAAGGATA A6 GTAAAATT CAAT6ATCAC 
3901 CCTGCATCTA HCCATCGCT GSTTTTCnT ACCCTTACTG ACTTCATTCC TCAAAATACA 
3961 SGA6TACTGT ACCTATCHG ATGAT6GTGT TAGCCGTGAT A6TGC6CC6G AA6ACCTCCC 
4021 ACA6TACAAA GA6ACCCACG AACA6TCGAA GGTTGAA6GC GCGGAAATCG CAAAGCA6AT 
4081 TGGAAAGAA6 ACGGGHACA ACATCTCAGG AACCGACCCA GAAGCAAAGG GTTATCACCG 
4141 CAAAGTTGCT GTCACACA AG TAATACCGCC CTT6ACTTGT ATCACHCCT GACATCATGC 
4201 TAATATHCT CTGTTTACCT CAAAG ACGTC AAAAGACAAG AC6C6TACTG TCACTATT6A 
4261 GCCAAAACAC AATGGATACG ACCCTTCCAA AGAGGTGGGT GATTATTATA CCATCATTCT 
4321 TTGGTACGCA CCAGGTTTCG ATGGCAGCAT CGTCGATGTG AGCAAGACGA CTGT6AATGT 
4381 TGAGGGTGGG GT6GAGCACC AAGITTATAA GAACTCC6AT HACATACGG TTGTTATCGA 
4441 CGTGAAG6AG GT6ATCGGTA CCACAAAGAG CGTCAAGATC ACATGTACTG CCGCTT/I/IGG 
4501 TCTTTTCnG GGGGCGGGAG GCGAGACCTT CGAAATGTAT ACGGGAGTGG TAACTCCGGG 
4561 AAAAT6GTGA TAT6GGGGAT CAAGTTGGAG GGGAATCTGT TTAnTCTTT ATTTCTTTAT 
4621 TTACTG6ATT GGAAAATAGG GAGCACAGTT CTGACTGGAT TGGTTTGATT GTTGGCCTCT 
4681 ACG6GTTCTC TTTACTTTGT CT6GAAATCC AATTTATT6T TATGC6 



SUBSTITUTE SHEET (RaE 26) 



PCr/EP94/03397 



FIGURE 15 





10 

1 


20 


30 

> 


40 

i 


50 

t 


60 

* 1 


1 


1 

ATGCAGGCAA 


1 

CGACAGGCGT 


1 

TTTTTGTTTT 


1 

ATCCGCAGAG 


1 

GTGCAGCAGC 


1 

AGGAAACAAA 


61 


CCATACAAAC 


ATTCCTTGAC 


GCGGTTTTAG 


GTGCAGTTAA 


GGCCCGGGCG 


CACCAAGAAC 


121 


ATTGATGTAC 


nGGTCTAAA 


AAAGATCATA 


ATACCCGATT 


AGTGTTCATG 


GTTTGATTGG 


181 


GTCTAAGTAC 


AAGTTTTACA 


6AGTTCAGCT 


TAGTTCATTG 


TTCGAAACTA 


CCAATATCAC 


241 


ACCTAT6CCT 


GCTGGCATTG 


ATAGCTC6GC 


TTGTGAAAGC 


TGAHACAAT 


CTTACATTTC 


301 


TGAHTAATA 


TCGGACTGAT 


CTATATATAA 


GGGTCATCAT 


TTCCTCTCC6 


CCinTGGTT 


361 


CTCmCATC 


ACCCCA6CCC 


AATCATCACC 


GTTGGCCTTT 


ACTTCTCTCT 


TCCGTTGATA 


421 


TTTTCTCGAC 


AAAACATCTT 


GTCCACT6TT 


AGGCTAGCTC 


CCAGAATTAT 


CCCTCCAACA 


481 


T6GCAGGATT 


ATCCGACCCT 


CTCAAHTCT 


6CAAAGCAGA 


GGACTACTAC 


GCTGCTGCCA 


541 


AAGGCTGGAG 


TGGCCCTCAG 


AAGATCATTC 


GCTATGACCA 


GACCCCTCCT 


CAGGGTACAA 


601 


AAGATCCGAA 


AAGCTGGCAT 


GCGGTAAACC 


TTCCTTTCGA 


TGACGGGACT 


ATGTGTGTAG 


661 


TGCAAHCGT 


CAGACCCTG7 


GrrTGGAGGG 


nAGATATGA 


CCCCAGTGTC 


AAGACTTCTG 


721 


ATGAGTACGG 


CGATGAGAAT 


AC6T6GGTCG 


CCCAGTCAAT 


TAACTATGCC 


GCTAGTGAn 


781 


ATG6AAAGCT 


TCTGCTAACC 


GATCAATGAG GCAT6TAGGA GGACTATTGT ACAAGACTAC 



841 ATGACTACTC TGGHGGAAA CTTGGACATT TTCAGAGGTC HACGTOGGT TTCTACGHG 
901 GAGGATTCGG GCGAGTACTA CACCTTCAAG GCAAGCCTCA GTGTTATATC TCGAATATAT 
961 TATATATCAC AACAAACTAA CTAGTCATAC A6T CCGAAGT CACTGCCGTG GACGAAACCG 
1021 AACGGACTCG AAACAAGGTC GGCGACGGCC TCAAGATTTA CCTATGGAAA AATCCCTTTC 
1081 GCATCCAGGT AGTGCGTCTC TTGACCCCCC 7GGTGGACCC TTTCCCCATT CCCAACGTAG 
1141 CCAATGCCAC AGCCCGTGTG GCCGACAA6G TTGTTT6GCA GACGTCCCC6 AAGACGTTCA 
1201 GGAAAAACTT GCATCCGCAG CATAAGATGT TGAAGGATAC AGnCHGAT ATTATCAAGC 
1261 CGGGGCACGG AGAGTATGTG G6TTGGGGAG AGATGGGAGG CATCGAGHT ATGAAGGAGC 
1321 CAACATTCAT GAATTATTTC AGTAA6CTCT TGAAAGATTT CCTATCTCTT 6ACGGTCGTT 
1381 TTTGCTAAGG AAACTGTAG A CHTGACAAT ATGCAATATC AGCAGGTCTA TGCACAAGGC 
1441 GCTCTTGATA GTCGTGAGCC G HGTAAGTA ACGTCCTGTG ACATGTCATG ATTACAGTAA 
1501 CTGATCGTTC AATAAG GTAT CACTCTGATC CCTTCTATCT CGACGTGAAC 7CCAACCCAG 
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FIGURE 15 CONTINUED 



1561 AGCACAAGAA CATTACGGCA ACCTTTATCG ATAACTACTC. TCAGATTGCC ATCGACTTTG 
1521 GGAAGACCAA CTCAGGCTAC ATCAAGC7GG GTACCAGGTA TGGCGGTATC GATTGTTA'CG 
1681 6TATCAGCGC GGATACGGTC CCGGAGATTG TGCGACTTTA TACTG6ACTT GTTGGGCGTT 
1741 CGAAGTTGAA GCCCAGGTAT ATTCTCGGAG CCCACCAAGC TT GTAAGCCC GCCCCCTTTA 
1801 CGATGCATTT ATTA6GG6TC CACAGACTAA ACTTGTTCCA AAG GTTATGG ATACCAGCAG 
1861 GAAAGTGACT TGCAT6CTGT TGTTCAGCAG TACCGT6ACA CCAAGT7TCC GCTTGATGGG 
1921 TTGCATGTCG ATGTC6ACTT TCA GGTAAAT GGCCCAGGTA TC6TT6AAGC TTTGGAGA/^J 
1981 GCTAftTTGTG CTCGTAAAAC TTTAAG 6ACA ATUCAGAAC GTTTACCACT AACCCGATTA 
2041 CGTTCCCTAA TCCCAAAGAA ATGTnACCA ATCTAAGGAA CAATGGAATC AAGTGHCCA 
2101 CCAACATCAC CCCT6TTATC AGTATCAGAG ATCGCCCGAA TGGGTACAGT ACCCTCAAT6 
2161 AGGGATAT6A TAAAAAGTAC HCATCATCG ATGACAGATA TACCGAGGGG ACAAGTGGGG 
2221 ACCCGCAAAA TGTTC6ATAC TCTTnTACG GCG6T66GAA CCCGGTT6AG GTTAACCCTA 
2281 AT6ATGTTTG GGCTCGGCCA GACnTGGAG ACAATTA GTA AGTTACTCAA TAGGCTACTT 
2341 GAGATAHCT GTAGGTG6CA TTAACAC6AC TATAG TfiACT TCCCTACGAA CTTCAACTGC 
2401 AAAGACTACC CCTATCATGG T6GTGTGA6T TAC6GATATG 6GAATG6CAC TGTAAGTGAT 
2461 AATAAGTCAT AAATACAACG TAATTCATGfi AGACTAATCA GTGGTAAATG AATTTTAG CC 
2521 AGGHACTAC CCTGACCTTA ACAGAGAGGA GGTTCGTATC TGGTGGGGAT TGCA6TACGA 
2581 GTATCTCTTC AATATGGGAC TAGAGTHGT ATGGCAAGAT ATGACAACCC CAGCGATCCA 
2641 TTCATCATAT GGAGACATGA AAGGGTTGCC CACCCGTCTG CTCGTCACCG CCGACTCAGT 
2701 TACCAATGCC TCTGAGAAAA AGCTCGCAAT TGAAAGTT66 GCTCTTTACT CCTACAACCT 
2761 CCATAAAGCA ACCTTCCACG GTCTTGGTCG TCHGAGTCT CGTAAGAACA AACGTAACTT 
2821 CATCCTCGGA CGTGGTAGTT ACGCCGGT6C CTATCGTHT GCTGGTCTCT GGACTGGAGA 
2881 TAACGCAA6T ACGT6G6AAT TCTGGAAGAT HCGGTCTCC CAAGnCTTT C7CTAGGTCT 
2941 CAATGGTGTG TGTATA6C6G GGTCTGATAC GGGTGGITTT GAGCCC6CAC GTACTGAGAT 
3001 TGGGGA6GA6 AAATATTGCA GTCC6GA6CT ACTCATCAG6 T66TATACTG GATCATTCCT 
3061 TTTGCCATGG CTTAGAAACC ACTACGTCAA GAAGGACAGG AAATGGTTCC A GGTAATATA 
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FIGURE 15 CONTINUED 

3121 CTCTTTCTGG TCTCTGAGTA TCGAAGACGC TAAGA CAA TA TAG GAACCAT ACGCGTACCC 
3181 CAAGCATCTT GAAACCCATC CAGAGCTCGC AGATCAAGCA TGGCTTTACA AATCTGTTCT 
3241 AGAAATTTGC A6ATACTG66 TAGAGCTAAG ATATTCCCTC ATCCAGCTCC TTTAC6ACTG 
3301 CATGTTCCAA AACGTGGTCG ATGGTATGCC ACTTGCCA6A TCTAT GGTAT 6CATTTTATC 
3361 CGTCTCCTTT CACGATAATG CACCAGTCTA ACC6AATTTT CTTTTAG CTC TT6ACCGATA 
3421 CTGAGGATAC GACCTTCHC AATGAGAGCC AAAAGTTCCT CGATAACCAA TATATGGCTG 
3481 GTGAC6ACAT CCHGTAGCA CCCATCCTCC ACAGCC6TAA CGAGGTTCCG GGAGAGAACA 
3541 GAGATGTCTA TCTCCCTCTA HCCACACCT G6TACCCCTC AAACTT6A6A CCGTGGGAC6 
3601 ATCA666AGT CGCHTAGGG AATCCTGTCG AA6GT6GCAG CGHATCAAC TACACTGCCA 
3661 GGAHGHGC CCCAGAGGAT TATAATCTCT TCCACAACGT GGTGCCGGTC TACATCAGA6 
3721 AG GGTAAGCG ATGGAATAAT TTCTTGCAA6 HCCAGATAC AAGTGGHAC TGACACCTTA 
3781 AACCAG GT6C CATCAHCCG CAAAHCAGG TACGCCAGTG GATTGGCGAA 6GAGGGCCTA 
3841 ATCCCATCAA GTTCAATATC TACCCTGGAA AGGACAA GGT ATAHCTCCA TGACTATC6C 
3901 GCATTTATTC THCTCTACT CfiCACTAACT TCATCTGAAT ATAG GAGTAT GTGACGTACC 
3961 neATGATGG TGHAGCCGC GATA6T6CAC CAGATGACCT CCC6CA6TAC CGCGAGGCCT 
4021 ATGAGCAAGC GAAGGTCGAA GGCAAAGAC6 TCCAGAAGCA ACHGCGGTC ATTCAAGGGA 
4081 ATAAGACTAA TGACHCTCC GCCTCCGGGA HGATAAGGA GGCAAAGGGT TATCACCGCA 
4141 AAGTHCTAT CAAACA GGTA CATGATTTCA TCTTCCTTTT HCGCAGTCA CTATTATATC 
4201 ATCCTAACAT TGCTTCTCTT ATTTAAAAG 6 AGTCAAAAGA CAAGACCCGT ACTGTCACCA 
4261 TTGA6CCAAA ACACAACGGA TACGACCCCT CTAAGGAAGT TGGTAATTAT TATACCATCA 
4321 TTCnTGGTA CGCACCGGGC TTTGACGGCA GCATCGTCGA TGTGAGCCAG GCGACCGTGA 
4381 ACATCGAGGG CGG6GTGGAA TGCGAAATTT TCAAGAACAC CGGCTTGCAT ACGGTTGTAG 
4441 TCAACGTGAA AGAGGTGATC 6GTACCACAA AGTCCGTCAA GATCACTTGC ACTACCGCTT 
4501 AGMCUm TATGAGGGGT ATATGGGAGT GGCAGCTCAG AAATTTGGGA AGCTTCTGGG 
4561 TAnCCTTTT GrTTAmAC mTTTATTG AATCGACCAA TACGGGTGGG AHCTCTCTG 
4621 GTnTTGTGA GGCTATGTn TACTTGGTCT GAAAATCAAA TTCGTTCTCA 
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HC - MAGFSDPLNFCKAEDYYSVALDWKGPQKIIGVDTTPPKSTKFPKNWHGVN -50 



MV - MAGLSDPLNFCKAEDyVAAAKGWSGPQKIIRYDQTPPQGTKDPKSWHAVN -50 

MC - LRFDOGTLGVVQFIRPCVWRVRYDPGFKTSDEYGDENTRTIVQDYMSTLS -100 



MV - LPFDDGTMCVVQFVRPCVWRVRYDPSVKTSDEYGDENTRTIVQOYHTTLV -100 

MC - NKLDTYRGLTWETKCEDS6DFFTFSSKVTAVEKSERTRNKVGDGLRIHLW -150 



••••••••••• • 



MV - GNLOIFRGLTWVSTLEDSGEYYTFKSEVTAVDETERTRNKVGDGLKIYLW -150 

MC - KSPFRIQVVRTLTPLKDPYPIPNVAAAEARVSDKVVWQTSPiaFRKNLHP -200 

MV - KNPFRIQVVRLLTPLVDPFPIPNVANATARVADKVVWQTSPKTFRKNLHP -200 

MC - QHKMLKOTVLDIVKPGHGEYVGMGEMGGIQFHKEPTFMNYFNFONMQYQQ -250 

HV - QHKNLK0TVL0IIKPGH6EYV6WGEN6GIEFMKEPTFHNYFNF0NHQYQQ -250 

MC - VYAQGALDSREPLYHSDPFYLDVNSNPEHKNITATFIDNYSQIAIDFGKT -300 

MV - VYAQGALDSREPLYHSDPFYLDVNSNPEHKNITATFIDNYSQIAIDFGKT -300 

MC - NSGYIKLGTRYGGIDCYGISADTVPEIVRLYTGLVGRSKLKPRYILGAHQ -350 

MV - NSGY1KLGTRYGG1DCYGISADTVPEIVRLYTGLVGRSKLKPRYIL6AHQ -350 

MC - ACYGYQQESDLYSVVQQYRDCKFPLDGIHVDVDVQDGFRTFTTNPHTFPN -400 

HV - ACYGYQQESDLHAVVQQYRDTKFPLDGLHVDVDFQDMFRTFnHPITFPN -400 

MC - PKEMFTMLRNNGIKCSTNITPVISINNREGGYSTLLEGVOKKYFIMDDRY -450 

MV - PKEMFTNLRNNGIKCSTNITPVISIRDRPNGYSTLNEGYDKKYFIHDDRY -450 

MC - TEGTSGNAKOVRYHYYGGGNKVEVDPNDVNGRPDFKDNYDFPANFNSKQY -500 



MV - TEGTSGDPQNVRYSFYG6GMPVEVNPN0VWARPDF6DNY0FPTNFNCKDY -500 

MC - PYHGGVSYGY6N6SAGFYPDLNRKEVR1WW6HQYKYLF0MGLEFVWQDHT -550 

MV - PYHGGVSYGYGNGTPGYYPOLNREEVRIWWGLQYEYLFNHGLEFVWQDMT -550 

MC - TPAIHTSYGOMKGLPTRLLVTSDSVTNASEKKLAIETWALYSYNLHKATW -600 

MV - TPAIHSSYGDMKGLPTRLLVTAOSVTNASEKKLAIESMALYSYNLHKATF -600 
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MC - HGLSRLESRKNKRNflLGRGSYAGAYRFAGLWTGDNASNWEFWKISVSQV 650 



HV - HGLGRLESRKNKRNFILGRGSYAGAYRFAGLWTGDNASTWEFWKISVSQV -650 

HC - LSLGLNGVCIAGSDTGGFEPYRDANGVEEKYCSPELLIRWYTGSFLLPWL -700 



MV - LSLGLNGVCIAGSDTGGFEPAR-TEIGEEKYCSPELLIRWYTGSFLLPWL -699 

MC - RNHYVKKORKMFQEPYSYPKHLETHPELADQAWLYKSVLEICRYYVELRY -750 



MV - RMHYVKKDRKWFQEPYAYPKHLETHPELAOQAWLYKSVLEICRYHVELRY -749 

MC - SLIQLLYDCMFQNVVDGMPITRSMLLTDTEDnFFNESQKFLDNQYHAGD -800 

MV - SLIQLLYOCMFQNVVOGMPLARSHLLTOTEDTTFFNESQKFLDNqYMAGO -799 

MC - 01LVAPILHSRKEIPGENRDVYLPLYH1VYPSNLRPWDDQGVALGNPVEG -850 

MV - DILVAPILHSRNEVPGENRDVYLPLFHTWYPSNLRPWDDQGVALGNPVEG -849 

MC - 6SVINYTARIVAPEDYNLFHSVVPVYVREGAIIPQIEVRQUT6QGGANRI -900 

NV - GSVINYTARIVAPEDYNLFHNVVPVYIREGAIIPQIQVRQWIGEGGPNPI -899 

MC - KFNIYPGKOKEYCTYLDOGVSROSAPEOLPQYKETHEQSKVEGAEIAKQI -950 

MV - KFNIYPGKDKEYVTYLDDGVSRDSAPDDLPQYREAYEQAKVEGKDVQKQL -949 

MC - G KKTGYNISGTDPEAKGYHRKVAVTQTSKDKTRTVTIEPKHNGYD -995 

MV - AVIQGNKTNDFSASGIDKEAKGYHRKVSIKQESKDKTRTVTIEPKHN6YD -999 

HC - PSKEVGDYYTIILWYAPGFDGSIVDVSKnVNVEGGVEHQVYKNSDLHTV -1045 

MV - PSKEVGNYYTIILWYAPGFOGSIVOVSQATVNIEGGVECEIFKNTGLHTV -1049 

MC - VIDVKEVIGTTKSVKITCTAA -1066 

MV - VVNVKEVIGTTKSVKITCnA -1070 
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MAGFSDPLNF C KAEOYYSVA LDWK GPOKII 6VDTTPPKS1 KFPKNWH6VN LRFDDGTLGV VQFIRPCVWR 

VRYDPGFKTS OEYGDENTRT IVQDYMSTLS NKLOTYRGLT WETKCE0S6D FFTFSSKV.TA VEKSERTRNK 

VGDGLR IHLW KSPFRI QVVR TLTPLKDPYP IPNVAAAEAR VSDKVVWQTS PKTFRKNLHP OHKMI KDTVI 

DIVKPGHGEY VGW6FMGGIQ FMKEPTFHNY FNFDNMQYQQ VYAQGALDSR EPLYHSDPFY IDVNSNPEHK 

NIIATFIONY SQIAIDFGKT NSGYI KLGTR YGGIDCYGIS ADTVPEIVRL YTGLVGRSK L K PRYILGAHQ 

ACYGYQOESD LYSVVOQYRD C KFPLD6IHV DVDVQDGFR7 FTTNPHTFPN PKEMFTNLRN NGI KCSTNIT 

PVISINNREG GYSTLLEGVD K KYFIMDDRY TEGTS6NAKD VRYMYYGGGN KVEVDPNDVN GRPDFKDNYD 

FPANFNSKQY PYH6GVSYGY GN6SAGFYPD LNRKEVRIHM GHQYKYLFDM GLEFVWQDMT TPAIHTSYGD 

MKGLPTRLLV TSDSVTNASE KKLAIETVAL YSYNLHKATU HGLSRLESRK NKRNFIL6RG SYAGAYRFAG 

LMT60NASNU EFWKISVSQV LSL6LNGVCI AGSOTGGFEP YRDANGVEEK YCSPELLIRW YTGSFLLPWL 

RNHYVKKORK WFQEPYSYPK HLETHPELAD QAWLYKSVLE ICRYYVELRY SLIQLLYDCM FQNVVDGMPI 

TRSMLLTOTE OnFFNESQK FLDNQYMAGO DILVAPILHS R KEIPGENRD VYLPLYHTWY PSNLRPWDDQ 

6VALGNPVE6 GSVI NYTA RI VAPEDYNLFH SVVPVYVREG AIIPQIEVR Q WTGQGGANRI KFNIYPGKOK 

EYCTYLDDGV SRDSAPEDLP QYKETHEQSK VEGAEIAKQI GKKTGYNISG TDPEAKGYHR KVAVTQTSKD 

KTRTVTIEPK HNGYDPSK EV fiPYYTIILWY APGFDGSIVD VSK TTVNVEG GVEHQVYKNS DLHTVVIDVK 

EVISTTKSVK ITCTAA 
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FIGURE 18 ^/o-t. 

MAGLSDPLNF RKAEDVYAAA KGWSGPQKII RYDQTPPQGT KDP KSWHAVN LPFDD 6TMCV VQFVRPCVWR 

VRYDPSVK TS DEYGDENTRT IVO[>YMTTLV GNLDIFRGL T WVSTLEDSGE YYTFKSEVTA VDETERTRNK 

VGD6LKIYLW KNPFRIQVVR LLTPLVDPFP IPNVANATAR VADKVVWQTS PKTFRKNLHP QHKMLKDTVL 

DIIKPGHGEY VGWGEMGGIE FM KEPTFMNY FNFDNMQYOQ VYAQGALDSR EPLYHSDPFY LD VNSNPFHK 

NITATFIDNY SQIAIDFGKT NS6YIKLGTR YGGIDCYGIS AOTVPEIVRL YTGLVGRSKL K PRYILGAHQ 

ACYGYQOESD LHAVVQQYRD TKFPLDGLHV DVDFQDNFRT FTTNPITFPN PKEMFTNLRN NGIKCSTNIT 

PVISIRDRPN 6YSTLNEGYD KKYFIHODRY TEGTSGDPON VRYSFYG6GN PVEVNPNDVW ARPDFG DNYD 

FPTNFNC KDV PYHGGV SYGY GNGTPGYYPD LNREEVRIWW GLQYEYLFNM GLEFVWQDHT TPAIHSSYGD 

HKGLPTRLLV TAOSVTNASE KKUIESWAL YSYNLHKATF HGLGRLESRK NKRNFILGRG SYAGAYRFAG 

LVfTGDNASTW EFWKISVSQV LSLGLNGVCI AGSDT6GFEP ARTEIGEEKY CSPELLIRUY TGSFLLPULR 

NHYVKKDRKW FQEPYAYPKH LETHPELADQ AWLYKSVLEI CRYWVELRYS LIQLLYDCMF QNVVDGMPLA 

RSMLLTOTEO TTFFNESO KF LDNQYHAGDD ILVAPILHSR NEVPGENRDV YLPLFHTWYP SNLRP WDDOG 

VALGNPVEGG SVINYTARIV APEDYNLFHN VVPVYIREGA IIPQIQVRQW IGEGGPNPIK FNIYPGKDKE 

YVTYLODGVS RDSAPODLPQ YREAYEQAKV EGKOVQKQLA VIQGNKTNOF SASGIDKEAK GYHRKVSIKQ 

ESKDKTRTVT lEPKHNGYDP SKEVGNYYTI ILWYAPGFDG SIVOVSQATV NIEGGVECEI FKNT6LHTVV 

VNVKEVIGTT KSVKITCnA 
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